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ITERATIVE RECONSTRUCTION AND STABILITY 
BOUNDS FOR SAMPLING MODELS 

ERNESTO ACOSTA-REYES 

Abstract. This paper studies the reconstruction of a function 
/ belonging to a shift-invariant space from the set of its non- 
uniformly distributed local sampled values. Here it is shown that 
if the set of sampling X — {xj}j & j satisfies a necessary density 
conditions, then we can recover the function / from the set of its 
samples geometrically fast using an iterative algorithm. In addi- 
tion, the algorithm is analyzed when the data is perturbed by noise, 
and it is proved that a small perturbation on the set of samples 
causes only a small change of the original function. Moreover, it 
is given an upper estimate of the rate of convergence of the algo- 
rithm. On the other hand, if we assume that X is a separated set, 
then it is shown that A is a set of sampling and explicit stability 
bounds are given. 



1. Introduction 

It is well known that in Sampling Theory there are two main goals 
(for an overview see [2], [4], [7]- [8], and [11]): First, given a class of 
functions on M. d , to find conditions on the sampling set X = {xj}j eJ , 
where J is a countable index set, under which a function belonging to 
that class can be reconstructed uniquely and stably from its samples 
{f{xj)}jeJ- Second, to find efficient and fast numerical algorithms for 
recovering the function from its samples on X. 

It is unrealistic to assume that the samples {f(xj)}jeJ can be mea- 
sured exactly. For working with a more realistic model, we consider 
that our function (signal) belongs to a shift-invariant space V p (<&), for 
some 1 < p < oo, of the form 

(1.1) \/ p ($) = { J2 c k®k : C e (F(Z d ))^}, 

keZ d 
and that the samples of the signal have the form 

9xAf) = / f(x)dti x Ax), 



Key words and phrases. Irregular sampling, Non-uniform sampling, Reconstruc- 
tion, Fast algorithm, Shift-invariant spaces, Stability bounds. 
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ERNESTO ACOSTA-REYES 

where \1= {fi x }jej is a collection of finite complex Borel measures 
on IR d that acts on the signal / in a neighborhood of Xj to produce 
the data {g x (f)}jeJ- The form of our sampled data generalizes the 
model presented by A. Aldroubi in [2] when for each j G J the Radon- 
Nikodym derivative of \x x . with respect to the Lebesgue measure on IR rf 
belongs to L 2 (IR d ). On the other hand, if the collection /J, consists of 
Dirac measures on M d concentrated at each point of X, then we obtain 
the model presented by A. Aldroubi and K. Grochenig in [4]. 

In this paper we apply an iterative algorithm for recovering the signal 
/ from its samples values {g x -(f)}jeJ which uses the density proper- 
ties of the set X, the support size conditions of the collection fl, and 
the properties of the generator $ for V p (<&). Here we show that the 
sequence of functions generated using the algorithm converges to / ge- 
ometrically fast. In [12], [14]-[18], this method was used for iterative 
reconstruction of band-limited signals, in [2] and [4], it was used for re- 
constructing functions belonging to shift-invariant spaces, and in [17] 
it was used for reconstructing signals belonging to a weighted multiply 
generated shift-invariant spaces. On the other hand, if X is assumed 
to be a separated set, then we show that X is also a set of sampling for 
V p (&) and /J,, and we give explicit stability bounds in terms of the rate 
of convergence of the algorithm, the generator for V P (Q), the bounded 
projection from L p (M. d ) onto V p (&), and the uniform upper bound for 
the total variations of the collection /J,. Moreover, it is given an upper 
estimate for the rate of convergence of the iterative algorithm. 

The stability of the samplingreconstruction is analyzed when our 
local sampled data is perturbed by noise, and we show that a small 
perturbation of the sampled data {g x (f)}jej m the £ P (J) norm pro- 
duces a small perturbation of our original function. 

The remainder of this paper has been organized as follows. Section 2 
introduces our sampling model, the definitions and notations that we 
shall work in this paper. The main results are presented in section 3, 
and we provide the proof of some of the results in section 4. 



2. Notation and preliminaries 

In this section is introduced the sampling model we use in this paper, 
and the notations that will be used later. 

The functions we are dealing with in this paper are functions / G 
L p (R d ), for some p G [1, oo] and d G N, which belong to a shift invariant 
space defined in (1.1), where $ = ((f) 1 , . . . , r ) T is a vector of functions, 
$fc = $(• — &), and C = (c 1 , . . . , c r ) T is a vector of sequences belonging 
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to (£ p (7j d ))( r \ Among the equivalent norms in (£ p (Z d ))( r ) we choose 

r 

l|C||(£p(z d ))(r) = 2_^ \\ c% \\&>(z d )- 



i=\ 



Here it is assumed that the set {0 1 (- — k), . . . , r (- — k); k G Z d } gen- 
erates an unconditional basis for V P (Q). In particular, we require that 
there exist constants < m p < M p < oo, such that 

(2.1) 

m p \\C\\ mzd))(r) < II J2 C >* r „ < K\\C\\ m z d) )(r), VC G (F(Z d )f\ 



kei 



LP 



The unconditional basis assumption (2.1) implies (see Theorem 2.4 
in [4]) that the space V p (&) is a closed subspace of L p (W d ). 

Since we are interested in sampling in V p (<&) we add an assump- 
tion that would make all the functions in these spaces continuous and, 
therefore, pointwise evaluations will be meaningful. Hence, we assume 
that the generator $ belongs to a Wiener- amalgam space (Wq)^ as 
defined below. For 1 < p < oo, a measurable function / belongs to W p 
if it satisfies 

(2.2) \\f\\w»= ( y>sssup|/(x + A;)|M < oo. 

If p = oo, a measurable function / belongs to W°° if it satisfies 

(2.3) ll/H w°° = supjesssup \f(x + k)\} < oo. 

k&z d xe[o,i] d 

Hence, W°° coincides with L°°(IR d ). It is well known that for p G [l,oo], 
W p is a Banach space ( see [9]-[10]), and clearly W p C L p . By (Pv" p ) (r) 
we denote the space of vectors \l/ = (ip 1 , . . . , ip r ) T of IF p -functions with 
the norm 



I*II(wp)M — / , 



II wp- 
i=i 

The closed subspace of (vectors of) continuous functions in W p (re- 
spectively, (W p )^) will be denoted by Wg (or ( W p ) (r) ). 

In this paper we are interested in average sampling performed by a 
countable collection of measures. We denote by Ai(WL d ) the Banach 
space of finite complex Borel measures on IR d . The norm on M. (JxL d ) is 
given by ||/i|| = f Rd d\fi\(y), i.e., the total variation of a measure \i. 

Let J be a countable index set and X = {xj : j G J} be a subset 
of IR d . The reconstruction problem in our sampling model consists of 
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ERNESTO ACOSTA-REYES 
finding the function / G V P (Q) from the knowledge of its samples 

9xjtf) = / f(x)dfi x .(x)\ , 

jRd > jeJ 

where fl= {fi Xj }j £ j is a countable collection of finite complex Borel 
measures on M d satisfying the following properties: 

(1) There exists a > such that supp^. C Xj + [—a,a] d , for all 

3 e J, 

(2) There exists M > such that H/^rH < M, for all Xj G X; and 

(3) f Rd d\x Xi = 1, for all j G J. 

Definition 2.1. Let 1 < p < oo and X = {xj : j G J} be a countable 
subset of R d . We say that X is a set of sampling for V P (Q) and y[i= 
{/^ . } je j if there exist constants < A p < B p < oo such that 

(2.4) A p \\f\\ LP < \\{g Xj (f)}\U P{J) < B p \\f\\ LP , for all / G V p ($). 
A p and i? p are called the stability bounds. 

Remark 2.1. If in the above definition we let p = 2, then applying 
Riesz representation theorem, it follows that (2.4) is the definition of 
frame. Thus, / can be reconstructed from its samples via dual frame 
expansion. 

Definition 2.2. We say that X = {xj}j$j C lR d is separated if there 
exists 5 > such that inf ijeJ,i^j \ x i ~ x j\ > ^- The number 5 is called 
the separation constant of the set X. 

Definition 2.3. A set X = {xj : j G J} C R d is 7- dense in R d if 

R<* = |j£ r (^), Vr> 7 , 
:i 

where B r (xj) = Yli=A xl j ~ r i x \ + r )- 

Definition 2.4. A bounded partition of unit adapted to {B 1 (xj)}j e j 

and associated with the sampling set X is a set of functions {(3j}jeJ 
that satisfies: 

(1) 0<Pj< 1, Vj G J; 

(2) supp/9j C B 1 (xj); and 

(3)E i6J ^- = i- 

Given a bounded partition of unit {j3j}jej associated with the sam- 
pling set X, we define the operator Ax on V p ($) as follows 

(2.5) A x / = £ &*(/)&■ 
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The quasi-interpolant operator Q x is defined on sequences c = {cj}j e j G 

nj) by 

(2.6) Qxc = 2>&. 

jeJ 

If f e Wg, we write 

(2-7) Qx / = £ /(*,■)& 

for the quasi-interpolant of the sequence Cj = f(xj). 

Remark 2.2. Note that if \i x . — 8 X ., for all j G J, where 5 X . is the Dirac 
measure on IR d concentrated at Xj, then Ax = Qx- 

3. Main Results 
In this section we collect the main results of our paper. 

Theorem 3.1. Let $ G (Wq)^\ 1 < p < oo ; and P be a bounded 
projection from L p (R d ) onto V P (Q). Then there exists a density 70 = 
7o( < l ) ,P,£>) > 0, anda = a ($, P,p) > such that every f G V p ($) can 
be recovered from the data {g x (f)}j<=j on any j— dense set X = {xj}j e j 
(0 < 7 < 7oy) /or any support size condition (for LL) < a < clq by the 
following iterative algorithm: 

(3.1) /i=PA x /, /n+i=P A x (/„-/) + /„■ 

In t/iis case ine sequence {f n }n>i converges to f in the W p norm, hence 
both in the L p (WL d ), and uniformly. The convergence is geometric, that 

is, 

||/n - f\\LP(Rd) < ||/n - /|| Wp < C p a n \\f\\ W p, 

for some a = a(P,7,a,$,p) < 1, and for some < c p < 00 indepen- 
dent of f and n G N. 

Remark 3.1. Notice that since $ G ( W^ 1 )^' then by Theorem 6.2 in [4] 
the existence of a bounded projection P is guaranteed for all p G [1, 00], 
and in this case it is given by P/ = ^2 keIj d(f, $(• — &))*£(• — k), where 
{&k\kei d is the canonical dual Riesz basis associated to {$k}k&z d - Here 

(/,*) = «/,£),..., </,^» e c\ (/,0*> = J Rd /(z) M*j<fe, for 1 < 
i < r, and 2; denotes the complex conjugate of z. 

Remark 3.2. Note that Theorem 4.1 in [2] is a Corollary of Theorem 
3.1 when we let p — 2 and r — 1. 
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Next result shows that if the hypothesis of Theorem 3.1 takes place, 
and X is also a separated set, then we obtain that X is a set of sampling 
for V p (&) and LI, and explicit stability bounds are given. 

Theorem 3.2. Let $ G (Wq)^ be given. Assume that X is separated 
with separation constant 5 > 0, and P is a bounded projection from 
L p (R d ) onto \/ p ($). Then the following hold: 

(1) Given f G V p (§), the sequence {f n }n>i defined by the algorithm 
(3.1) satisfies 

(3.2) ||/„|| LP(Rd) < ( 1+ 1 "~ a ) II/IIlpcr"), Vn > 1, 

where a is rate of convergence of the algorithm (3.1). 

(2) X is a set of sampling for V p (<&) and Li with stability bounds 
given by 

(3.3) A l ~ a 
and 

(3.4) B p 



'' 3 rf ||P|| op ATW 



MN 1/p ?> dlp \\<$>\ 



(pyi)M 



m p 



where M = Af(5,p,d) = ([^] + l) d , l + h = 1, [t] denotes the 
biggest integer lower than or equal to t, m p is the lower bound 
constant in condition (2.1), || P \\ op is the operator norm ofP, 
and M > is the uniform upper bound for the total variations 
of the elements in the collection LI. 

As a consequence of Theorem 3.1 and its proof, we obtain the fol- 
lowing result which allows to find an estimate for the values of 7 and a 
needed for the reconstruction algorithm (3.1). Moreover, Theorem 3.3 
provides an upper estimate of the rate of convergence of the algorithm. 

Theorem 3.3. Assume that $ G (W^ and |V$| G {W^) ir) , where 
|V$| = QV0 1 !, . . . , |V0 r |) T , andVcj) 1 is the gradient of(j) % fori < i < r. 
Let M > be such that \\fJ- x -\\ ^ M, for all j G J, and P be a bounded 
projection from L p (WL d ) onto l /p ($). Then we have the following upper 
estimate for the rate of convergence of the algorithm (3.1): 

a < ^k( 7( i + 2 M )^ + M((l+2[ 7 l) rf +2)a((l+2H) d ))|||V$||| (v , 1)M , 

TTLp 

where m p is the lower bound constant given in (2.1), and \i\ denotes 
the smallest integer bigger than or equal to t . 
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3.1. Reconstructing in presence of noise. 

Now we investigate the algorithm (3.1) in the case of noisy samples 
{fAjeJ S £ P {J), but we do not assume that and {fAj^j are samples 
of a function / G V p (&). Then given {/3j}jzj, a bounded partition of 
unit associated with X, we use the initialization: 

(3.5) /i = PQx {/,'}, /n+i = /i + (I-PAx)/„, Vn>l, 
and we have the following result. 

Theorem 3.4. Let $ G (W^ r \ {^'} ie j G £ P (J), and P a bounded 
projection from L p (M. d ) onto V p (&) be given. Then the algorithm (3.5) 
converges to a function f^ G V p (§), which satisfies P Ax foo = P Qx {/,•}• 

As a consequence of Theorems 3.1 and 3.4, the next result shows the 
stability of the sampling-reconstruction. 

Theorem 3.5. Let $ G ( Wq)^\ P a bounded projection from L p (W l ) 
onto V p (<&) be given, and assume the X is a separated set. Let {fj}jeJ £ 
£ P (J), and f G V p ($) with sampled values {gxj{f)}jeJ be given. Then 
the following holds: 

^ d \f 1 /p\\ P II 

(3.6) ||/-/oo||lp < \\ U) \\{g X] U)-f 3 }Ujh 

where tf =([&] + !)*, \ + \ = 1, a = ||I-PA X |U foo e V p (<5>) is 



i> 



the function given in Theorem 3.4, and 5 > is the separation constant 
of the set X . 

4. Proofs 

4.1. Auxiliary results. 

We begin this section with three results that are needed for the main 
proofs. 

The next Lemma collects basic facts about Wiener amalgam spaces, 
and shift-invariant spaces. For a proof of this Lemma see Proposition 
4.2 in [1]. 

Lemma 4.1. Let $ G (W 1 )^ , / = E C k®k, where C G (£ p (Z d )Y' r \ 

kei d 
and $,t = $(• — k), for all k G Z d . Then the following hold: 

(4.1) \/ p ($) C W p , for all 1 < p < oo, 

i/$e(Wtf)M. 

(4-2) 11/11 wp < l|C||(^(Z'*))w||^ , ll(i^ 1 )W- 
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We also need the following Lemma which will be stated without proof 
(see Lemmas 5.1 and 5.2 in [2], and Lemma 8.1 in [4] and the references 
therein) . 

Lemma 4.2. Let $ G (Wtf)^, and f = £ CT$ k , where C G 



k&L d 



(£P(Z d )Y r \ Then: 

(1) The oscillation osc 7 (/) belongs to W p . 

(2) The oscillation osc 7 $ satisfies 

(4.3) || osc 7 $|| (H/1)W < ((1 + 2[ 7 ]) d + 1)||$|| (H/1)W , 

and || osc 7 $||(^i)M — ► as 7 — >■ 0. 

(3) 7/| v^l e WY r \ then 

(4.4) ll$ll (H /i)M < 7(2M + l) d ||| V $|||(w")M 

(4) The oscillation osc 7 (/) satisfies 
(4.5) 

||osc 7 (/)||^ < ||C|| ( , P(zd))W ||osc 7 $|| (wl)W , yC e (F(Z d )Y r \ 

In particular, || osc 7 (/)|| \y P — > as 7 — > 0. Moreover, 
(4.6) 
llQx/HiP < HQx/llw* < ((l+2[7l) d +2)||C|| ( ^ d))M ||<l»|| (H ,i)«, VC 6 (f (Z d ))M. 

Lemma 4.3. Let $ G ( Wq)^ be given. Let P be a bounded projection 
from L p (M. d ) onto V p {<&). Then there exist 70 = 7o($,P,p) > ; and 
00 = a o( < l ) ) P,p) > snc/i that for anyO < a < a^, the operator I — P Ax 
is a contraction on V P (Q) for any 7— dense set X with < 7 < 70. 

Proof. Let P be a bounded projection from L p (R d ) onto V p ($), and 

/ = E Cf ^ where C G (F(Z d ))M be given. Then 

feez d 



'jW 



-A x ) 



\f(x)-(Qxf)(x)\ = \f(x)-J2f(x 3 )fr 

= I £(/(*)- /(*;))& 

< £i/(*)-M-)i#(aO 

< osc 7 (/) (x) ^ Pj (x) = osc 7 (/) (x) . 
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From this pointwise estimate and (4.5) we obtain 

||/-Qx/||lp < ||/-Qx/||w> 

< || OSC 7 (f)\\wP < l|C||(^(Z d ))('')|l OSC 7 < ^ll(W 1 )('') 



< 



1 



m r 



LP || OSC 7 $||( H /l)(r), 



where we have used condition (2.1) in the last inequality. Consequently, 

1 



(4.7) 



||/-Qx/||lp< 



m. 



LP 1 1 OSC 7 $||(^l^(r). 



On the other hand, 
|(Qx/-Ax/)(a;) 



= | £(/(**) -M/))&(* 

= |E(/ (f^ J )-m)dn Xj (z))f3 j (x 

< E/ IM-)-/(«)|d|Mxil(^(*) 

< e^^/x^)/?^) / d W(^) 






x 



^ M E ( E E i c ^i osc « (^)fo - fc ))/^)- 

By using Lemma 4.2, condition (2.1), Triangular inequality, and the 
above pointwise estimate, we have 

(4.8) ||Q x /_Ax/||lp<— ((l + 2[ 7 l) d + 2)||osc a $|| (H , 1)M ||/||Lp. 

Since / G V p (<5>), then P / = /. Therefore, 

HZ-PAx/Hip < ||P/-PQx/||lp + ||PQx/-PAx/||lp 

< II P \\o P \\f - Qx /||lp + || P IUH Qx / - A x /||lp. 

Using now (4.7), (4.8), and the above inequality we get 
(4.9) 

||/-pa x /||lp< " P 



op 



in,, 



osc 7 $|| (H/ i )M +M((l+2[7]) c( +2)||osc a <l>|| (w ,i )M 



LP- 
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Let < e < ,, rap be given. Since || osc 7 $|L w i\(r) — > as 7 — »■ + , 
then there exists 70 = 7o(e, <3>, P,p) > 0, and ao = a (e, $, P,p) > 
such that 

|| osc 7 <&|| (W ri)(r) < -, for all < 7 < 70, 
and 

M((l + 2[ 7 l) d + 2)||osc a $|| (w , 1)W < e -, for all < a < a . 

If we choose 70 and ao so that for any < 7 < 70, and < a < a we 
have 

||/-PA X /||i><^*||/||LP, 

then the conclusion of the Lemma follows. □ 

4.2. Proofs for Section 3. 

Now we are ready to prove our main results. 

Proof of Theorem 3.1. 

Proof. Let e n = f — f n be the error after n iterations of the algorithm 
(3.1). Then the sequence {e n }„ e jj satisfies 

e„+l = / - fn+l = f -fn- P A X (/ - fn) 

= (I-PA x )(/-/ n ) = (I-PA x )(e n ). 

By using Lemma 4.3, there exist a density 70 > 0, and ao > such 
that for any < 7 < 70, and 0<a<a ,I — P Ax is a contraction on 
V P (Q). Therefore, by taking a := \\ I — P Ax|| op < 1, we have 

||e n +i||i> < a||e n ||i>, 
and by induction it follows that 

(4.10) ||e n+1 || LP <« n+1 ||/|U P , 

and ||e n ||x,p — >• geometrically fast. Since for V P (Q) the W p norm, and 
the IP norm are equivalent, then (4.10) also holds in the W p norm and 
uniformly on M. d , and Theorem 3.1 is proved. □ 

Proof of Theorem 3.2. 

Proof. Let us prove (3.2). Note that by hypothesis and Lemma 4.3 we 
have that there exists 70 > such that I — P Ax is a contraction for 
any 7— dense set X with < 7 < 70 . Hence, a — \\ I — P A x || p < 1, 
and thus, the operator PAx is invertible on V p (&). It is not hard to 
show that PAx and (P Ax) -1 satisfy: 

(4.11) l-a< ||PA x || op < 1 + a, 
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and 

(4-12) t^^IKPAx)- 1 !!^ 



op 

a i — a 



Let / G V p (&) be given. Since {f n }n>i given by algorithm (3.1) sat- 
isfies f n = f\ + e 1 + e 2 + . . . + e n _i for n > 2, / x = P A x /, and since 
{e n }„>! satisfies e n = (I — P A x )e n _i, for n > 1, then we get 



/„ = / 1 + ^(i-PA x y/ 

Hence, 

ll/nllip < II/i||lp + II/IUp>;||i-pa 

< (||PA x ||o P + 



i=l 






J=l 



n 

^ «' J 11/ Ik" 



i=l 
/ a \ ,, „., /I + a — a 2 

- ( 1+a+ r^^J ll/llLP = { i- a y ||j||Lp ' 

and we obtain (3.2). Let us show (3.3). Let / G V P (Q) be given. Then 
by Lemma 4.3, there exists 70 > such that the operator I — P Ax is a 
contraction on V p ($) for any 7— dense set X with < 7 < 70. Hence, 
a = 1 1 1 — P Ax 1 1 op < 1, the operator PAx is invertible, and (4.12) 
takes place. On the other hand, from the definition of the operators 
Ax and Qx, it follows that Ax/ = Q,x{9xj(f)}, and thus, PAx/ = 
PQx {<?,,(/)}■ Therefore, / = (PAx)" 1 ? QxK(/)}- Consequently, 

II/IIlp < ll(PAx)- 1 ||o P ||P||o P ||Qx||o P |IK(/)}||^(j) 
II p ll ll n ll 

, M 7 \\op SJX \\op II r /(-Mil 

< y^x IK^-(/)}lkn^- 

In order to complete the proof of (3.3), we need an upper estimate for 
|| Qx \\o P - Let x be the characteristic function of the set -B 7 (0) + [0, l] d . 
Clearly, we may assume without loss of generality that < 7 < 1. 
Since < j3j < 1, and supp/J,- C B^(xj), then for all Xj G k + [0, l] d , 
Pj{ x ) < xi x ~ k)- Therefore, 

|Qxc| = 5^Cj^-(x) < ^2 ( Yl \ c j\)x(x-k), 

jeJ k£Z d j:x j £k+[0,l] d 
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and by (4.2) in Lemma 4.1, we have 



Qxc\\ W p < ( E ( E 



i/p 

_ c j\) ! 1 1 \ 1 1 1 v ->- - 

ke1 d j:Xj&k+[Q,l} d 



Since X is a separated set with separation constant S > 0, then there 
are at most M = M{5,p,d) = ([$] + l) d sampling points Xj in each 
cube k + [0, l] d . By applying Holder's inequality, we get 



( E m)'<*»*' E 

j':xjefc+[0,l] d j:x J gA:+[0,l] d 



\c-\ p 



where - + — = 1. Consequently, 

||Qxc|| W p <A/" 1/p IKcjXI^^HxIIw 1 - 
We leave to the reader the proof of || x|| w 1 = 3 d - Therefore, 

||Qxc|U P <3 d A/- 1/p '||{ Cj }|| £P(J) , 
and 

(4.13) ||Qx|| op <3^ 1/p '. 

Hence, 

^ir^ir^Tw 11/11^ ^ IIK(/)}llw» fora11 /em 

"-> II " II opJv 

Let us show (3.4). Note that 

E M/)i p = El/ /(*)<^(* " 

— — I /rod 

a; i efc+[0,l] d x j Gfc+[0,l] d 



* E IM>( / l/WI%T V 

^■^ \ /rod U T 

x,-6fe+[0,l] d ^ M llrXj " 

< E ikii p / i/(*)i° dW( * 



x.,efc+[0,l] d H^^H 

< m p e esssu p \f( z )\ p - 

Xj ek+lo,i]d zex ^- a ^ d 
Since X is a separated set, then there exist at most M = J\f(8,p,d) = 
([ s ] + ^) d sampling points in each cube k + [0, l] d . Assuming without 
loss of generality that < a < 1, then 

E M/)I P < M^Afesssup \f(z)\P. 

Xj ek+[o,i]* ,efc+[o,i]d 
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Consequently, by taking the sum over k G Z d in the above inequality, 
we obtain: 

< MM /P 3 /P ||C , ||( £ p( Z d)- ) (r)|| < l ) ||( H /l)(r) 

< ^"^^r, 

m p 
and Theorem 3.2 is proved. D 

Proof of Theorem 3.3. 

Proof. The proof of Theorem 3.3 is a straightforward consequence of 

(4.3), (4.4), and (4.9). □ 

Proof of Theorem 3.4. 

Proof. Assume the hypothesis of Theorem 3.4 holds. From Lemma 
4.3, the operator I — P Ax is a contraction. Consequently, the sequence 
{fn}n>i defined by algorithm (3.5) converges to a function /^ G V p ($). 
By taking limits in both sides of (3.5) as n — > oo, we have: 

/oo = /l + (I-PA X )/oo. 

Therefore, /i — P A x /oo = 0. Taking into account that f\ = P Qx {fj}, 
then the conclusion of Theorem 3.4 follows. □ 

Proof of Theorem 3.5. 

Proof. Assume that the hypothesis of Theorem 3.5 holds. By Lemma 
4.3, there exists 70 > such that the operator I — P Ax is a contrac- 
tion on V p (&) for any 7— dense set X with < 7 < 70. Hence, 
a = 1 1 1 — P Ax 1 1 op < 1, the operator PAx is invertible, and (4.12) 
takes place. On the other hand, from the definition of the operators 
A x and Qx, it follows that A x / = Qxidx (f)}, and thus, PA X / = 
PQxK(/)}- Therefore, / = (PA X )^PQ X {^.(/)}. By apply- 
ing now Theorem 3.4, then there exists a function /^ G l /p ($) such 
that PAx/oo = PQx{/;}- Hence, f x = (P Ax)" 1 P Qx {/,'}• Conse- 
quently, 

II/-/00IU. < ll(PA x )- 1 ||op||PQx||op||K J (/)-/,'}||£nj) 
11 p 11 11 n 11 

, r op V{Xop||f / r\ /.'-i II 

< — y^ — \\{9x 3 U) - fj}\\tp(j)- 
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Now the conclusion of Theorem 3.5 follows by using (4.13). 



□ 
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Abstract. In this paper, we deal with higher order Boussinesq systems of 
equations in one dimension, that has been presented by Bona, Chen and Saut in 
[1]. We show that the solutions of these systems of equations with a nonlinear 
power a > 1 are global and decay in time for small initial data, and we show 
also that they blow-up in finite time. 

Keywords decay in time, Boussinesq equation, blow-up 

AMS Subject Classification: 35B40; 35Q10; 35Q20 

1. Introduction 
In this paper, consideration is given to the higher order Boussinesq system 

(1.1) r\ t - br] xxt + b 2 r\ xxxxt + u x + au xxx = -(/i(r?, u)) x + b 2 {f\{f], u)) xxx , 

(1.2) u t - du xxt + d 2 u xxxxt + i] x + ar] xxx = -(/ 2 (r?, u)) x + d 2 (f 2 (r],u)) xxx , 

with initial data rj(x, 0) = rio(x), u(x,0) = uq(x), and where the nonlincaritics 
fi(rj,u) = r] a u + cir/u a , f 2(11,11) = r/ a+1 + C2U a+1 , with a > 1 integer, c\,c 2 G M; 
the constants a, b, d, b 2 , d 2 G M. verify some conditions below, and x G M, t > 0.. 
The system (1.1)-(1.2) describes the propagation of surface water waves. Here, the 
independent variable, x, is proportional to distance in the directional of propagation 
while t is proportional to elapsed time. The quantity T](x, t) + ho corresponds to the 
total depth of the liquid at the point x at time t, where ho is the indisturb water 
depth. The variable u(x,t) reprents the horizontal velocity at the point (x,t). Our 
study here is devoted to the following particular cases for the system (1.1)-(1.2): 

CI = [b > 0, d > 0, 6 2 = 0, d 2 > 0, with a > : case Cl-1 or a = : case Cl-2] 

C2 = [b > 0, d > 0, b 2 > 0, d 2 = 0, with a > : case C2-1 or a = : case C2-2] 

C3 = [a = 0, b > 0, d > 0, b 2 > 0, d 2 > 0] 

C4 = [b > 0, d > 0, b 2 = d 2 = 0, with a > : case C4 - 1 or a = : case C4 - 2] 

( C5 - 1 : a ^ 0, b > 0, d > 0, b 2 = 0, d 2 > 0, or 
C5 = <^ 

[ C5 - 2 : a ^ 0, b > 0, d > 0, b 2 > 0, d 2 = 0. 

A local existence result has been obtained by Bona et all. in [1] for the full I.V.P 
(1.1)-(1.2) when a = 1 and a < 0, b > 0, d > 0, b 2 > 0, d 2 > 0, ci,c 2 G E. On 
the other hand, the case (C4-2) which corresponds to the purely BBM-Boussinesq 
and the case (C4-1) have been shown by the same authors to be locally well posed 
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in H S (R) s > ( see Bona et all. [1]). Global existence in H S (R) s > 1 has been 
proved in [1] for (l.l)-(l-2) under the condition a < 0, b = d > 0, b 2 = d 2 = with 
a = 1. Note that this condition is include in the case (C5). Furthermore, when 
studying the system (1.1)-(1.2) under the restriction (C4-1) with a — 1 and with 
complete or partial dissipation, Chen and Goubet showed in [2] that the solution 
of this system decays to zero when t goes to +oo. 

Our study here is concerned with the asymptotic behavior of the solution to the 
Boussinesq system (1.1)-(1.2). In the first part, we show that the solution of (1.1)- 
(1.2) under the restrictions (CI), (C2), (C3) or (C4), and with a > 5 (for the cases 
Cl-1, C2-1, C4-1) or a > 9, (for the cases Cl-2, C2-2, C3, C4-2), decays to zero 
when t goes to +oo. 

In the second part, we show that the solution of (1.1)-(1.2) under the restriction 
(C5) with a = —b= —d 2 and where a > 1, blows-up in a finite time. 

1.1. Notation. The notation || • || rp is used to denote the norm in L£(= H r,p ) such 
that if we set J r = (1 - J^ ) r / 2 , then for u G L£(R), ||w|| riP = ||u|| L? = \\J r u\\ hP < 
co. Also, | • \ p instead of || • ||o, p denotes the norm in L p , and M. s is used instead of L 2 . 
Throughout the paper, c represents a generic constant independent of t and x. The 
Fourier transform of a function / is denoted by /(£) or T(f)(£) and ^ r ^ 1 (/) = / 
denotes the inverse Fourier transform of /. 

2. Local Existence in time. 

In this section, we study the local existence in time for the solution to the Cauchy 
problem associated to (1.1)-(1.2) under the conditions CI, C2, C3, C4, or C5 above. 
We prove the following theorem: 

Theorem 2.1. Let rjo,uo G EP +1 (R), s > ^ real. Then there exists a positive 
constant Tq > and a unique solution (r),u) G C(0, To,H s (IR)) of the Cauchy 
problem associated with (1.1)- (1.2) under the conditions CI, C2, C3, C'4 or C5. 

Proof. The system of equations (1.1)-(1.2) can be written 

(2.1) U t = AU + DF(U) 

where 

V -a-^ + ^rHi + aJ^z! o 

and 
D = 



-(l-b^+b 2 ^)-\l-b 2 ^)l 




o -(i-d^ + d 2 ^)-Hi-d 2 ^)£ 



The operator A has a matrix valued symbol ^4(£), (£l, which is diagonalizablc, 
and we note A the diagonalized matrix symbol of A. Hence we can write A = 
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PA{P)~ 1 , where P = (pij), i,j = 1,2 is the matrix of the eigenvectors associated 
with the eigenvalues zAi and i\ 2 with 

A (£) = e(l-< 2 ) = _ A (e) 

Let 

S(t){<p u <p2) = ^"V'tei,^)) - (^- 1 (e I Vi),^ 1 (e I V 2 )) - (S 1 (t) Vl ,S 2 (t)^ 2 ), 

be the semi-group that occurs in the computation of the ordinary differential 
equation Ut — AU whenever it is done in the eigenvectors basis, with for j = 
1,2, Sj(t)tfj = j^J R e lx ^ +ltXj ^ifjd£ i . Then, after a few computations in which we 
diagonalize the matrix symbol of A and use the Duhamcl formula, we obtain the 
following integral form of the solution to the I.V.P. (1.1)-(1.2): 

(2.2) T)(x,t) = Pn * [Si (t) (in *V0 + h2 * U ) + S 2 (t)(hi *T]o + ll2 *«o)] 

+ / Pn * S 1 (t-r)(h 11 * fi{t],u) +hi2 * f 2 (r],u))dT 
J o 

+ / Pn * S 2 (t - r)(h 2l *f 1 (T),u) + h 22 * f 2 (r],u))dT, 
J o 

(2.3) u(x,t) = p 2 i * [Si (t) (in *?7o + ii2 *"o) - S 2 (i)(in *r] + l 12 *uo)] 

+ / P21 * Si(t-r)(/in */i(»y,w) +/112 * f 2 (i],u))dT 

J 

p 2 i * S 2 {t-T)(h 21 * fi(ri,u) + h 22 * f 2 (r),u))dr, 

o 

where 

~ 1 ~ 1 

Pll = Pl2 = , , ^„ , = , P21 = -P22 = 



Vi + w + w ^i + de+d 2 e 

ki = hi = \v^+be + b 2 T\ w 2 = h 2 = -\^i + de + d 2 £,\ 

~ ~ -ig(i + b 2 g 2 ) r ? _ -^(i + ^e 2 ) 

111 — ft 21 — , , ^„ , = , "42 — ^22 — 



2tJT+W+W " 2^1 + de + d 2 ^' 

To finish the proof of the theorem 2.1 and for the sequel we need the follow- 
ing inequality obtained thanks to the use of the Sobolcv embedding H S (R) C 

L°°(R), s > -, and thanks to the fact that H S (R), s > -, is an algebra : 

(2.4) \\fi(v,u)\\ s + \\f 2 (v,u)\\ s < C (h a M || s + || U Q r,|| s + h«+ 1 || s + || U «+ 1 || s ) 

< c(|^|-- 1 ||^|| a ||«|| i + |ti|Sr 1 ||^|| a ||«|| a 

+NST 1 NI 2 + NSr 1 NI 2 ) 
< c(|H|?||«|| a + |H|?|H| 8 + |H|? +1 + H? +1 ) 

and we need the inequalities in the following lemma : 

Lemma 2.2. Let a, b, d, b 2l d 2 in (1.1) satisfy the conditions (CI), (C2), (C3), (C4) 
or (C5). T/ien Jet V G H S+1 (K), s > §, Mi = 1 - b£, + b 2 ^, M 2 = 1 - 
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d-g^2 +(^2^4) and Pij , hij , lij \fi, j = 1,2 defined above. Then we have the following 
inequalities \/i,j = 1,2, 

(2.5) ||^(r)V|| s + \\Pij * ^(r)V|| s + ||p y * £,-(r)(fc« * V)IL< 

(2.6) l|5j(r)(^j * VOL + llPij * SjMfe * VOL 

(2.7) ll^(r)fe*V0L 

Proof. . Proof of the lemma 2.2 Thanks to the definitions of Pij,hij,lij and Mj 
above, we have for all i,j = 1,2 and for all a,b 1 d 1 b 2l d 2 satisfying the conditions 
(CI), (C2), (C3), (C4) or (C5), the inequalities 



< 


Ms 


< 


IIV'll.s+i 


< 


\\M^\\ 



(2.8) 


\Pij\ + \Pijhij\ < 


c 


(2.9) 


\hij\ + \Pijkj\ < 


c(l + £ 2 )* 


(2.10) 


\iui>\ = Ml = |Ai^(XiV)l < 


|>(r?| 


(2.11) 


I?i2^| = M\ = \k 2 T{M 2 ^)\ < 


|A^| 


where Ai(£) = 


- ,..* .. .. and X 2 (0 - _ ,. . i . , 


— . Then with the inequalities 



(2.8), (2.9), (2.10), (2.11) and thanks to the Planchcrel theorem and the definition 
of the norm W and of Sj, we are lead to the inequalities of the lemma 2.2. □ 

Now, consider the complete metric space 

F={(7 7 , U )e(C(0,T;H s (M))) 2 ,sup||r ? || s + sup|| U || s < M }, s > \, 

[0,T] [0,T] l 

where /i is a positive real constant. Then an application of the contraction-mapping- 
principle to F combined with the inequalities (2.4), (2.5), (2.6), (2.7), above, and an 
appropriate choice of T yields to the local existence result of the theorem 2.1. □ 

3. Linear Estimates 

The purpose of this section is to study the linear equation associated with the 
I.V.P (1.1)-(1.2) under the restrictions CI, C2, C3, or C4. We establish also linear 
estimates needed for the next section. For that we give some decay estimates and 
useful inequalities of the solution of the linearized system (1.1)- (1.2) via decay 
estimates of the semi-group S(t)(ipi,ip 2 ) — (Si(t)ipi, S2{t)ip 2 ). 
Consider the linear problem associated to (1.1)-(1.2): 

{Vt - Wxxt + hVxxxxt + u x + au xxx = x,t€l, 
Ut - du xxt + d 2 u xxxxt + r/ x + ar/ xxx = 
with initial data n(x, 0) = rjo(x), u(x, 0) = Uq(x). We prove the following theorem 

Theorem 3.1. LetT] (x), J 2 r]o(x), Mjf]o(x), u (x), J 2 u (x), MjU (x) e H 6 (IR)n 
L 1 (M) where for each j = 1,2, A4j is defined above in lemma 2.2. Then the solution 
(rj,u) of the linear problem (LP) satisfies 
(3.1) 

( c(l + i)-s, m the case CI -1,C2- 1,C4- 1; 
|jy(o;,*)|i,oo( R )+|u(a;,t)|L«>(R) < < 

[ c(l + *)-5, m the case CI - 2, C2 - 2, C3, CA - 2. 

for allt>0, x <G K, where c does not depend on x or t. 
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This theorem is a consequence of the following lemma : 

Lemma 3.2. Let for j = l,2,ifj, J 2 ip 3 , Mjifj G H 6 (M) nL'fR) where Mj are 
defined above in lemma 2.2, and let Sj(t) be as defined above as the components 
of the semi-group S(t) that occurs in the computation of the linear equation (LP). 
Then for each i,j = 1,2 with Pi,j,hij,lij defined above we have the estimates, 
Vt > 0, Vx g R, 

(3.2) \pij * Sj(t)ipj(x)\ L °° + \Sj(t)<Pj{x)\ L °° < c(l + t)~* Civile + |Vj|i) 

if a, b, d, b 2 , d 2 satisfy (CI - 1), (C2 - 1) or (C4 - 1), 

(3.3) \pij * S,(i)^(x)| L ~ + |Sj(t)^-(a:)|£oc < c(l + t)~'(lk>ill6 + l^li) 

if a, b, d, b 2 ,d 2 satisfy (CI - 2), (C2 - 2), (C3) or (C4 - 2), 

(3.4) \Pij * SjWhij * ip^L- <c(l + t)-'(||<ft|| 8 + |A>j|i) 

if a, b, d, b 2 , d 2 satisfy (CI - 1), (C2 - 1) or (C4 - 1), 

(3.5) \Pij * Sj{t){hi d * ipj)^ <c(l + t)-5(||^.|| 8+ |j2^|i) 

if a, b, d, b 2 ,d 2 satisfy (CI - 2), (C2 - 2), (C3) or (C4 - 2), 

(3.6) \p itj * Sj(t)(li,j * <Pj)\loo < c(l + t)-*(||Mi^-||6 + \M m \x) 

if a,b,d,b 2 ,d 2 satisfy (CI - 1), (C2 - 1) or (C4 - 1), 

(3.7) | Kj * Sj(t)(iij * ^)U~ < c(l + t) - '(||M^il|6 + \MjifjU) 

if a, b, d, b 2 , d 2 satisfy (CI - 2), (C2 - 2) (C3), or (C4 - 2), 

where c is independent ofifj,x 

In order to prove the lemma 3.2 we need the following proposition. 
Proposition 3.3. Given x G R and t G R+, consider the phases functions 

where Xj are defined above. Then under the conditions C1,C2,C3 or C4, ^-, j = 
1,2 /iai>e a finite number of stationary points. Moreover for each j = 1,2, there 
exists at least a stationary point £ s j of ipj which verifies 

(3.8) R(e sj ) > 
where 

R(y) = l-3ay-[bd+b 2 +d 2 +2a(b+d)]y 2 -[2(b 2 d+bd 2 )+a(bd+b 2 +d 2 )]y 3 -3b 2 d 2 y 4 +ab 2 d 2 y 5 

is variable whenever a, b,d,b 2 ,d 2 satisfy the conditions C1,C2,C3 or CA 

Proof. Proof of proposition 3.3. As given above we have (for the full system (1.1)- 
(1-2)) 

A,(0 = (-I)'" 1 ^^ 



va + K 2 + b 2 ^ 4 )(i + de + d 2 ^y 
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Then for j — 1,2, 

V>/(0 = ** A/(O + i _1 ^ = 

«• {1 - 3< 2 - N + &2 + d 2 + 2a(6 + d)]£ 4 - [2(6 2 d + 6d 2 ) + a(bd + b 2 + d 2 )]f 
-3b 2 d 2 £ 8 + ab 2 d 2 £ w } 2 - x 2 r 2 {\ + bf + fo 2 ? 4 ) 3 (l + d£ 2 + d 2 ^ 4 f = 
so that 

(3.9) i>/(0 = «• P(£ 2 ) = 

where 

P(y) = i?(y) 2 - x 2 t- 2 (l + 6y + 6 2 y 2 ) 3 (l + dy + d 2 y 2 f 
with 
P(y) = l-3ay-[bd+b 2 +d 2 +2a(b+d)]y 2 -[2(b 2 d+bd 2 )+a(bd+b 2 +d 2 )]y 3 -3b 2 d 2 y 4 +ab 2 d 2 y 5 . 

Therefore thanks to (3.9), the stationary points of "0 are such that their square are 
roots of P(y). Hence since P(y) is a polynomial of degree 12 so that it has at most 
12 roots, we deduce from (3.9) that each ipj, j = 1,2 has at most 24 stationary 
points in R. Let us prove now the second part of the proposition 3.3 (for all the 
cases Cl,C2,C3or C4). 

Proof for the case CI. 

If a, b, d, b 2 ,d 2 satisfy CI, then (3.9) is verified by 

P(y) = R(yf - x 2 r 2 {\ + byf{\ + dy + d 2 y 2 f 
where 

R(y) = l-3ay- (bd +d 2 + 2ab + 2ad)y 2 - (2bd 2 + abd + ad 2 )y 3 . 
We see that for fixed x and t large enough, 

(3.10) P(0) = 1 - x 2 r 2 > 0. 

Moreover, R(0) — 1 > and lim^ >oo-R(£ 2 ) = — oo so that since £ i — ► R(£, 2 ) is 

continuous, there exists at least a £o > such that R(£,q) = 0. Henceforth, with a 
such £ we have 

(3.H) P(£ 2 ) = -x 2 t- 2 {l + 6^) 3 (1 + d£l + d 2 e f. 

Therefore thanks to (3.11) and since b, d, d 2 are positives, we have 

(3.12) P(e ) < o. 

Hence since £ i — ► -P(£ 2 ) is continuous, we deduce from (3.10)-(3.12) that the equa- 
tion P(S, 2 ) — has at least one solution £ s G]0,£o[ where £o > is a root of R(£, 2 ). 
We deduce with (3.9) that for each j = 1,2, Vj has at least one stationary point 
£sj g]0,£o[- Moreover, since for each j = 1,2, -0j has a stationary point £ S j which 
verifies 

(3.13) < £„• < Co 

where £o is a root of R(£ 2 ), and hence since for all y > 0, -R(y) is a decreasing 
function, we get for each j = 1,2, thanks to (3.13) 

(3.i4) R(e sj )>R(e )=o 
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and the inequality (3.14) finishes the proof of the proposition 3.3 for the case CI. 

(Note here that if we had £o < £ S j < then we would have < ^ • < £q and 

consequently (3.14)). 

Proof for the case CI. 

The proof of the proposition 3.3 for the case C2 follows from that of the case CI 

by symetry. 

Proof for the cases C3 and C4. 

For the cases C3 and C4, we see that the functions P(y) and R(y) from the case 

C3 and that from the case C4 verify the same properties as that of the functions 

P(y) and R(y) in the proof for the case CI, particulary the properties (3.9), (3.10), 

(3.12), (3.14). Therefore, following the same lines of the proof for the case CI, we 

find the proof of the second part of the proposition 3.3 for the cases C3 and C4. 

This finishes the proof of the proposition 3.3. □ 

Let us prove now the lemma 3.2 

Proof. . Proof of the lemma 3.2 

If < t < 1, we have thanks to the definitions of Sj(t), j — 1,2, above in section 

2, and tpj in proposition 3.3, and thanks to the Schwartz inequality, we have 

(3.15) ISMnW = ^|/ R e^«)^(CR|< C (| R (l + e 2 )- 6 ^) 2 ||^l| 6 

< c||^|| 6 < c(l + i)~*||<£j 6 . 

If t > 1, let fl = {£ e R |£| < t^},m > 1, and q jt (£) = x^e 4 *^'; then thanks 
to the Schwartz and the Young inequalities, 

(3.16) toM^aOl = h { j n + I J eltXM)+lX ^ M)d ^ 

< c\qj t (x) * <pj(x)\oo 

+C {Lr {1+e)(>dC ) 2 {I J 1 + e ^t)\ 2 d^ 2 

< c\qjt(%)\oo\<Pj\i + ct~™\\<pj\\ 6 . 

It remains to estimate qjt(x). We need for the sequel the following notations: Let 
£ s = {( £ R, V'j(C) = 0) 3 — 1) 2} be the set of stationary points of i[)j. We know 
from the proposition 3.3 that £ s has a finite number of elements. Hence set for 
m > 1, 

M(U~™)= (J B((,t-^) \J{teR\t + (\<t-mj{teR\Z\<t-^} 

(e£ s (e£ s 

where for each ( e £ s , B((,t~™) = {£ e R |£ - C| < £~™}. Then we have 

(3.17) q j t{x) = {( 1 +[ 1 )e itx ^ +ix ^ = h + I 2 . 

J ftrW(C,t ™) J on{AA(C,t ™)} c 
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Since from proposition 3.3, card(£ s ) < oo, we get 

(3.18) \h\ < f i <%< J2 I , <*C 

J nrW(C,t~™) c&£s J s(c,t-s) 

Ce£a J {\i + C\<t-^} J {|||<t-m} 

For 72, we point out that on {A/'(C,£ - ™ : )} C , Vj, J — 1,2 have no stationary point 
so that we can integrate I 2 by parts as follows: 

(3.19) \h\ = \[ , e *^«)dei 

J sin{Af(c,t~™)} c 

= i 1 1 f 1 A (e lt ^ (i Ad£\ 



9{nn{jV(c,t-m)}c } \ M ^M)\ 



d ^M)\ 



■/ ^n{AT(c-,t--)}= l^"0i(€)| 2 
where for each j = 1,2, £ S j is a stationary point of ipj, which satisfies the property 
R{€sj) > in the proposition 3.3. 

Before continue, let us give the needed following inequalities useful for the proof of 
the lemma 3.2 : We claim that on Qr\{Af(^ S j, *~™ )} c > and for m > 3, if a, b, d, b 2 ,d 2 
satisfy (Cl-1), (C2-1) or (C4-1), then for each j = 1,2, 

ivaoi 2 



(3-20) hfj^ < 



(3-21) ^7^ < 



1 jV ;l I ci™ otherwise. 



and if a, b, d, b 2l d 2 satisfy (Cl-2), (C2-2), (C3) or (C4-2), then for each j = 1, 2, 

Proof. Proof of (3.20) and (3.21) . 

Thanks to ipj and Xj as defined above in proposition 3.3 and in section 2, we have : 

i>' j (t;) = \ j (0 + t- 1 x. 
We find (after obvious computations), 

where for the full system (1.1)-(1.2), 

R(y) = {l-3ay-(bd+b 2 +d 2 +2ab+2ad)y 2 -(2b 2 d+2bd 2 +abd+ab 2 +2ad 2 )y a -3b 2 d 2 y 4 +ab 2 d 2 y 5 } 

and 

Q(y) = (1 + by + b 2 y 2 )(l + dy + d 2 y 2 ). 
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Then, with £ S j S £ s chosen as in the proposition 3.3, we get 

(3-22) |^(0I = W'j&j) ~ ^(01 - \*'j(S.i) - A; (01 

where 



m&m 2 ) qQ(q 2 (£ 2 ) + Q(e)Q(e SJ ) + q 2 q 
^q^o + a/wj 

(3-23) +v/Q 3 (^)(^)-^ 2 ))}- 

The following relations are also useful for the proof of the inequalities (3.20), (3.21): 

(3.24) 161 + 161 = 16 + 61 if sgnfa) = sgnfa) 

(3.25) 161 + 161 - 16-61 if sgnfa) ± sgnfa) 

The proof of (3.24) and (3.25) is as follows : If sgn(£i) — sgnfa) then 

161 + 161 = 1161 + 1611 = k.gn(6)6 + * 5 n(6)6l = 16 + 61 

If sgn(£i) y^ sgn{S,2) that is if sgn(£i) = — sgnfa) then 

161 + 161 = 1161 + 1611 = |s<?n(6)6 + ^(6)61 = 16 - 61- 

We are now able to pull up the proof of the inequalities (3.20) and (3.21) for the 

different cases CI, C2, C3, C4 : 

Proof of (3.20) for the case C'l-1. 

In the case Cl-1 we have thanks to the definitions of R and Q above, 

^(6 2 J )-^(6) = (6-6 2 ,)[3a+(^+d2+2afe+2ad)(6+6 2 ,)+(26rf2+aci 2 + a M)(6 4 +66 2 ,+6 4 ,)] 
and 

q(6) - Qio = (£ 2 - oib +d+(bd+ d 2 )(e + e sj ) + bd 2 (e 4 + ee sj + o\ 

so that thanks to (3.23), 

(-1)^(6 -6) 
a j (6,)-a j (6 = l /,. u , w { 



^Q 3 (6 2 ,)Q 3 (6) 



-^— [6+d(bd+d 2 )(£ +6j) 



V / 3 (6 2 ,) + yo 3 TF) 

+ M 2 (6 4 + 64 + 4)] + y/Q 3 (e sj )[3a + (W + d 2 + 2afe + 2arf)(6 + £■) 
(3.26) + (2M 2 + ad 2 + a6rf)(6 4 + 66', + &•)]}■ 
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Then, since by proposition 3.3 R(£,gj) > 0, and since Q(£, 2 ) > V £ € R, we find 
with (3.22), (3.26), and the fact that a,b,d,b 2 ,d 2 > 0, 

l£ 2 -£ 2 -l 
1^(01 > -y=f{3a + (bd + d 2 + 2ab + 2ad)£ 2 + (2bd 2 + ad 2 + abd)^} 

c|£ 2 -£ 2 l 
(3 27) > — ^ 

" (i + be)Hi+de+d 2 ^)i' 

On the other hand, we find here with obvious computations, 

(l + b^ 2 )2(l + dC 2 + d 2 ^ 4 )2 
Then with (3.27), (3.28), we find 

,, 9Q , 1^(01 . c|e|(i + 6g 2 )l(i + rf€ 2 + rf 2 e 4 )* 

Hence if |£| > 1 we have on ft n {-/Vfej.* - ™)} thanks to (3.29), 
(«0) J$g < *' B 



so that when sgn(^) = sgn(£ S j) we get on SI n {A/"(^ S j, t _ ™ )} c , thanks to (3.30) 
and (3.24), 

1^(01 < c|C| 6 



l^(OI 2 " K-^TOI + M 2 

(3 - 31) -if-^™ 2 = in^- ct "" ~- c< 

and likewise when sgn(£) ^ sgn(!; S j) we find with (3.30) and (3.25), 
Hence (3.31) and (3.32) give for |£| > 1 and on SI n {W^tT ™)} c , 

ffl— ■ 

On the other hand, if |£| < 1 then thanks to (3.29) and proceeding as in (3.31) and 
(3.32) we find on St n {A/"(6j,*"-)} c , 

This finishes the proof of the inequality (3.20) for the case (Cl-1). 
The proof of (3.20) for the cases (C2-1) and (C4-1) and that of (3.21) for the cases 
(Cl-2), (C2-2), C3 and (C4-2), follows the same lines as that of the proof of (3.20) 
for the case Cl-1, using the R(y) and the Q(y) that correspond to these cases. This 
finishes the proof of the inequalities (3.20) and (3.21). □ 
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Now, with the inequality (3.20) in hands and thanks to (3.19), we find for the 
cases (Cl-1), (C2-1), and (C4-1), 

(3.35) \h\<ct- l {( t^d£_+ [ t&dO^ct-^. 

J {I«I<1} J {1<I?I<«5^} 

Likewise for the cases (Cl-2), (C2-2), C3 and (C4-2), we find in the same ways 
thanks to the inequalities (3.19) and (3.21) 

(3.36) |/ 2 |<cr^. 

Therefore, taking m = 4, we find for the cases (Cl-1), (C2-1), (C4-1), thanks to 
(3.35), (3.18), (3.17), and (3.16), Vi > 1, 

(3.37) \S {t) V] {x)\ < cT *(ta-|i + H^-He) < c(l + t)-'(|^-|i + ll^lle) 

and taking m — 8, we find for the cases (Cl-2), (C2-2), C3 or (C4-2), thanks to 
(3.36), (3.18), (3.17), and (3.16),, 

(3.38) |Sj(t)^(a;)|<c(l + t)-5(|^|i + ||^||6) Vt > 1. 

(3.37), (3.38), with (3.15) give apart of the inequalities (3.2) and (3.3) in the lemma 

3.2. 

In order to finish the proof of the lemma 3.2, we need the following lemmas which 

are also useful for the sequel. 

Lemma 3.4. Let h be such that (1 — JW)M£) € L 2 (R). Then we have h{x) G 

l}(R)c\L 2 (R). 

Lemma 3.5. Let for each i,j = 1,2, pij be defined as above in section 2. Then 
for a,b,d,b2,d2 satisfying CI, C2, C3 or C4, we have p^ G L (R) n L (R). 

Proof. Proof of lemma 3.4 and lemma 3.5. We have thanks to the Planchcrcl 
theorem, 

f (l + x 2 ) 2 \h(x)\ 2 dx - j \T{{l + x 2 )h{x)){i)\ 2 di 

JR JR 

/a2 
l(l-^2)M0| 2 ^<C<0O. 

This shows that h G L 2 (R). Moreover, with (3.39) and thanks to the Schwartz 
inequality, we get 



(3.40) / \h(x)\dx< ( / (l + x 2 )- 2 dx)?( / (1 + x 2 ) 2 \h(x)\ 2 dx)^ < c < oo 

^R JR ^R 

that is h G L (R) and the lemma 3.4 is proven. Now, thanks to the lemma 3.4 the 
definition oipij in the section 2, the proof of the lemma 3.5 follows immediately □ 

We are now able to finish the proof of the lemma 3.2. We begin by endding the 
proof of the inequalities (3.2) and (3.3) : Since from lemma 3.5 p^ G L X (R) i, j — 
1,2, and thanks to the estimates of Sj(t), i,j — 1,2, in (3.2) and the Young 
inequality, we find for the cases Cl-1, C2-1, C4-1, 

\Pij * Sj (*) <Pj 0) I oo < \Pij 1 1 1 Sj (t) <Pj {X) | oo 

(3.41) < c(l + t)-5(|^| 1 + ||^.|| 6 ) Vi>0, 
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which finishes the proof of the inequality (3.2) . We finish the proof of the inequality 
(3.3) in the same manner. To prove the others inequalities (3.4), (3.5), (3.6), (3.7) 
of the lemma 3.2, we use the definitions oipij,hij,lij i.j = 1,2, in the section 2, 
and we proceed as above and as in the proof of the lemma 2.2. This ends the proof 
of the lemma 3.2. □ 

Proof. Proof of the theorem 3.1. Writing the solution (77, u) of (LP) in its integral 
form as follows, 

(3.42)77(a;,i) = P11 * [S'i(t)(/ii *77o + /i2 * u ) + S 2 (t)(ln * 770 + ^12 *«o)] 

(3.43) u(x, t) = P21 * [Si(t)(hi * i]o + I12 * uo) - Sz(t)(lii * T]o + I12 * tio)]- 

the proof of the theorem 3.1 follows immediately from the inequalities of the lemma 
3.2. □ 

4. Global existence and decay for the solution of the NL system. 

Let us state now the results of the global existence and decay properties of the 
solution of the nonlinear system (1.1). 

Theorem 4.1. Let a > 5 and let r]o(x) , J 2 i]o(x), A4jT)q(x), Uo(x), J 2 uo(x), A4jUo(x) £ 
H 7 (R) n L X (K) where Mi = 1 - b£, + b 2 -§^, M 2 = 1 - d£? + d 2 -§^. Suppose 
that, for each j = 1,2, 

\Mjr) (x)\i + \MjU (x)\i + ||A^j77o(ic)|| 6 + WM^x)^ < S, 

5 sufficiently small. Then if a > 5, the solution (r],u) of the Cauchy problem 
associated to (1.1)- (1.2) with a,b,d,b 2 ,d 2 satisfying the conditions (Cl-1), (C2-1) 
or (C4--1) verifies 

(4.1) |»?(»,*)|oo + Ka;,*)|oo < c(l + t)~K Vt>0, 

(4.2) \\n(x,t)\\ s +\\u(x 7 t)\\ 8 < c. 

Otherwise if a > 9, then the solution (n, u) of the Cauchy problem associated to 
(1.1)-(1.2) with a,b,d,b 2 ,d 2 satisfying the conditions (Cl-2), (C2-2), (C3) or (C4- 

2) verifies 

(4.3) \n(x,t)\ 00 + \u(x,t)\ 00 < c(l + t)~i, Vt>0, 

(4.4) lh(a;,*)l|8+||«(a;,t)||8 < c. 

Proof. In addition with the inequality (2.4) and those of the lemma 2.2, we need 
for the proof of the theorem 4.1, the following inequalities: From the definition of 
/1, f 2 given in (1.1)-(1.2), we find thanks to the Holder inequalities, 

(4.5) |J 2 (/i(r7^))|i< C (HSo" 1 ||«ll2h|| 2 + | W | co |HSr 2 ||r ? ||2) 
and 

(4.6) \J 2 (f2(v,u))\ 1 <c\u\^- 1 \\u\\l + c\ V \^ 1 H\l 
Now define the quantity 

Q(t)= sup {(l + r)'(|»?(r)|oo + |«(r)|oo) + IH|8 + N|8} 

0<T<t 

We will consider here the integral form of the nonlinear solution of (1.1)-(1.2) as 
given by (2.2)-(2.3) in the section 2 above. Therefore, taking the L°° norm of the 
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solution (rj, u) written in its integral form, and thanks to the inequalities in the 
lemma 3.2 of section 3, and the inequalities (4.5), (4.6), (2.4), we find for each 
j = 1,2„ if a,b lC ,d,b2,d 2 satisfy (Cl-1), (C2-1), or (C4-1), 

(4.7) |»?(»,*)|oo + |«(*, t)U < c(l + t)-i(\M 3 i] Q (x)\ 1 + \\M :i 'no(x)\\e 

+ \MjUo(x)\i + \\MjUo{x)\\e) 

rt 

\Ot + l 



+ cQ{t) a ^ (i + (t- r ))-*(l + r)-^-dT. 
J o 

But since for a > 5, 

(1 + (t-T)) _ l(l+T) _5 T k dT < C(l+t)~i, 

o 

we deduce from (4.7) that for a > 5, when (1.1)- (1.2) is under the conditions (Cl-1), 
(C2-1), or (C4-1), 

(4.8) (l + T)i(|»y(T)|oo + KT)|oo) < c(\MM x )\i + \\MM x )h 

+ \MjU (x)\i + \\MjUo(x)\\ 6 
+ Q(t) a+1 ). 

Furthermore, when (1.1)-(1.2) is under the conditions (Cl-1), (C2-1), (C4-1), we 
get for a > 5 thanks to (2.4) and the inequalities of the lemma in section 2 above, 

(4.9) ||»?(x,*)||8 + ||u(ar,*)||8 < c{||»jo(a:)||9 + ||«o(a:)||9 

+Q(t) a+1 f (l + ry^dr} 
J o 

< c{\\ m (x)\\ 9 + \\u (x)\\ 9 + Q(t) a+1 }. 

Then, thanks to (4.8), (4.9) and the definition of Q(t) above, we are lead for a > 5, 
to the inequality 

(4.10) Q(t) < cllMjtioli + lMjUoh + WMjTioh + WMjUoWfi 

+ WMjUoWe + hollo + ||uo||g + Q{t) a+1 }- 
Henceforth, thanks to the inequality (4.10), we find that, if 

(4.11) \Mjtjo\i + \MjUq\i + WMtfoh + \\M 3 u \\e + Wmh + \\uoh < s 

with 6 small enough, then Q(t) is bounded. Indeed, the inequality (4.10) is satisfied 
if Q(t) £ [0,/?i] U [/?2,oo[ with < (i\ < j3 2 < oo (since S is small). Thereby, since 
by the definition of Q(t), Q(0) = \i]o\oo + l u o|oo + ll^olls + ll u oll8i we have thanks 
to the Sobolev embedding, H 8 (i?) C L°°(i?) and with (4.11), 

(4.12) Q(0)<c(||»to|| 8 +||uo||8)<c& 

Then, the continuity of Q(t) and the inequality (4.12) allow us to conclude that Q(t) 
remains bounded for 8 small enough and for all t > 0; otherwise, (4.12) would be 
contradicted. Thus we have obtained for a > 5, a bound of Q{t) and consequently, a 
bound and a decay estimate of the local solution which permit us to extend globally, 
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for a > 5 and for small initial values, the local solution of the system (1.1)-(1.2) 
under the conditions (Cl-1), (C2-1) or (C4-1). On the other hand, following the 
same lines above, we find for a > 9, the same resultas above for the solution of 
(1.1)-(1.2) under the conditions (Cl-2), (C2-2), (C3) or (C4-2). This finishes the 
proof of the theorem theorem 4.1 □ 

5. Blow up in finite time for the solution of the NL system. 

We consider here the system (1.1)-(1.2) under the condition C5-1 with f\ = 
and a = —b = —di < 0. That is 

(5.1) rit - brj xxt = -u x + bu xxx i,(el, 

(5.2) u t - du xxt + bu xxxxt = -r) x + br) xxx - (f-2(r),u)) x + b(k(r),u)) xxx 

with T](x, 0) = ri (x), u(x, 0) = ito(x), 77(00, t) = n x (oo, t) = r) xx (oo, t) = u(oo, t) = 
u x (oo,t) = u xx (oo,t) = and where, f2{Vi u ) — u a+1 or / 2 (i),h) = rj a+1 or 

h{-n, u ) = ur]+ -nf, a> 1. 

Then, setting Pi = 1 - b^ and P 2 = 1 - dj^ + &J^, (5.1)-(5.2) becomes 

(5.3) 774 = -u x i,(el. 

(5.4) u* = -P^Pifa + M^u))*- 

We already know from the theorem 2.1 above the local existence of the solution 
of (5.3)-(5.4). We will prove now a blow up theorem of the solution to (5.3)-(5.4). 
Note that all the theorems in the sequel, work also, thanks to the symetry, for the 
solutions of (1.1)-(1.2) under the condions (C5-2). 
Before giving the blow-up theorems, let us prove the following needed lemmas. 

Lemma 5.1. If there exist functions uq(x) G II s , u>o(x) € EF , s > — , 
such that the initial values r](x,0), r]t(x,0), u(x, 0), satisfy the relations 
n(x, 0) = (w (x)) x , f] t (x, 0) = -(u (x)) x , 

then for allt € [0,T], the solution (f],u) of the Cauchy problem associated to (5.3)- 

(5.4) satisfies 

n(x,t) = (w(x,t)) x , 

with a corresponding evolution ofw(x,t), rj(x,t) satisfying the system 

(5.5) Wt(x,t) = —u(x,t) x, i € R, 

(5.6) u t = -P^ 1 P 1 ( V + f 2 (r,,u)) x . 

The couple of functions (w,u) belongs to C^QO.T]; IF +2 ) x ^([O, T]; H s+1 ) 
Proof. From (5.3) we find 

(5.7) T)(x, t) = T)(x, 0) - / (u(x,s)) x ds. 

Jo 

The term rj(x,0) is an x— derivative by hypothesis and L u x ds is an x— derivative. 
Therefore, there exists a w(x, t) such that r](x,t) = w x . The second part of the 
lemma is proved by Theorem 2.1 above. □ 

We need also, for the result of blow-up, the following lemma proved in (Levine, 
1974). 
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Lemma 5.2. Suppose if){t), t > 0, is a positive, twice- differentiable fonction ip 
sastifying the inequality 

V' / V-(i + 7)(V>') 2 >0 

where 7 > 0. 

If i/j(0) > an d "0'(O) > 0; then ip(t) ► 00 as t — ► t\ < — ; (t\ is a positive 

constant). 

Let us give now the blow-up theorem. 

Theorem 5.3. (Blow-up for the solution of (5.3)-(5.4))- 
Suppose that there exists 7 > such that 



(5.8) r 1 (r 1 + f 2 (r ] ,u))<2(l + 2 1 )F 2 (7 1 ,u) 

where F 2 is such that 

BF *—■ 

— = 77 + A / 2 (m, v)+^2 X i u3 

~g- = MV + f2 (»?,«)) 

wii/i t'o > 0, Xj > 0, Vj, fc > anrf where A = 1 i/ / 2 is woi a purely function of 
u. 

Suppose also that the initial values r/(x,0), r]t(x,0), u(x, 0), are chosen such that 
they satisfy 

• rj(x,0) = (w (x)) x , T] t (x,0) = -(u (x)) x , for some u (x) E IF +1 , w (x) E 
^+ 2 , s>\, 

• E(0) = l -{u ,P^ l P 2 u a ) + j R F 2 { m ,u a )dx < 0. 
Then, the solution (r/,u) of (5.3)-(5.4) blows-up infinite time. 

Proof. Proof of the theorem 5.3. For example: If f 2 (j],u) = u a+l then F 2 (rj 1 u) — 

-rj 2 + w] -\ -u a+2 , or if f 2 {j], u) = r/ a+1 then F 2 (rj, u) = -rj 2 -\ ;:?7 Q+2 , or 

2 a + 2 ' 2 a + 2 

if f 2 (r\, u) = -rj 2 + w] then F 2 (r\, u) = -rj 2 + wq + -r/u 2 + -rfu + -r? 3 . 

Zi Zi Zi Zi \J 

To begin the proof, let 



E(t) = l -{u,P^P 2 u)+ f F 2 ( V ,u)da 



We claim that 



(5.9) E(t) = E(0) = l -{u^P^P 2 u a ) + I F 2 ( Vo ,u )dx < 0. 
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Indeed, thanks to (5.3), (5.4) and integrations by parts, we have 



f BP BP 

(5.10)£'(t) = (u t ,P^P 2 u) + / {,, t _i (,,,„) + Ut _ ±( v ,u)}da 

J R drj du 

f dF 2 
BP 

P 2 lp i(V + f2(V, u)) x -g-(V, u ) dx 

k 



,u)- u x (r) + X f 2 (u 7 i]) + y^ XjU 3 )d,3. 

jR .7=1 



-is / P 2 - L Pi(v + .f2(v,u)Uv + f2(v,u))dx = 0. 
Jr. 



Now, define ip(t) as follows. 



V>(t) = (w, P^P 2 w) + p {t + to) 2 



where (3q and to are non-negatives constants to be defined later. Suppose that the 
maximal time of existence is infinite. A contradiction will be obtained by lemma 
5.2. The condition necessary to apply the lemma 5.2 is that tp"ip— (l + j)(tp') 2 > 0. 
We have by hypothesises of theorem 5.3, and thanks to the lemma 5.1 and integra- 
tion by parts, 



V>'(i) = 2(w t , Pr 1 P2w) + 2/? (t + t ) = 2(-«, P^P 2 w) + 2/3 (t + t ), 



and 



V>"(t) = 2(u,P^ 1 P 2 u)+2(P 2 - 1 P 1 (r 1 + f 2 (r 1 ,u)) x ,P^ 1 P 2 w) + 2f3 
= 2(u, P^P 2 u) - 2(t? + / 2 (77, u),ri) + 2/3 

= 2{u 1 P^ 1 P 2 u)-2 [ [r 1 (r 1 + f 2 (? 1 ,u))-(2 + 4 1 )F 2 (ri 7 u)}dx 
Jr 

-2(2 + 4j)E{t) + (2 + 47)(«, Pf^u) + 2/3 
> 4(1 + 7 )( U ,P- 1 P 2U )- 2(2 + 4 7 )£(t) + 2/3 . 
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It follows that, since ip(t) > 0, and using the Schwartz inequality and the Parseval 

formula, 

^'V-(l + 7 )(^') 2 > [4(l+7)KPr 1 P 2 «)-2(2 + 4 7 )S(t) + 2/3o]V(t) 

-4(1 + 7 )[(-«, P^P 2 w) + (3 (t + t )} 2 

> [4(1 + 7 )(«, P^P 2 u) - 2(2 + Arj)E(t) + 2/? ]^(*) 
-4(1 + 7)[(«, P^P 2 u)(w 7 P^P 2 w) 
+2(u,p- 1 P 2 u)Hw,p- 1 P 2 w)ip (t + t Q )} 

-4(l + 7 )/3 2 (t + t ) 2 

> [4(1 + 7 )(«, P^P 2 u) - 2(2 + Arf)E{t) + 2/3r#(i) 
-4(1 + 7 )[(«, P^P 2 u){ Wl P^P 2 w) 
+2(u,P^ 1 P 2 ufi(w,P^ 1 P 2 w)if3 (t + t Q )} 

-4(1 + j)M{t) + 4(1 + 7 )A>K P^P 2 w) 

> -2(1 + 2 1 )[2E(t) + /3o]V(«) + 4(1 + 7)(«, P^P 2 u)f3 (t + t a f 
-8(1 + 1 ){u 1 P^ 1 P 2 u)^{w,P^ 1 P 2 w)^f3o{t + t ) 

+4(l + 7 )/3o(^,Pr 1 P 2 w) 

> -2(l + 2 7 )[2E(t) + p }m 

+4(1 + 7 )(«, P^ 1 P 2 u)f3 Q (t + t ) 2 - 4(1 + 7 )(«, P^P 2 u)fo(t + t ) 2 
-4(1 + 7 )A)(w, Pf 1 P 2 w) + 4(1 + 7)A,(w, Pf^ttf) 

> -2(1 + 27)[2E(t) + ##(*)■ 

Thereby, since £(i) = £(0) < 0, it follows by taking j3 = -2£(0), that ip"ip - 
(1 + 7)(^') 2 > 0- Also ^'(0) = 2(-uo,P^ 1 P 2 w ) + 2(3 Q t > if t is sufficently 
large. Thus, by lemma 5.2, ip{t) becomes infinite at a time T\ at most equal to 



t&o = — 7 < °°- Therefore, we have a contradiction with the fact that the 



7^'(0) 

maximal time of existence is infinite. Henceforth, there is blow up in finite time, 
and the maximal time of existence is finite. □ 
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Abstract 



Here is introduced the concept of Caputo fractional radial derivative for a 
function denned on a spherical shell. Using polar coordinates we are able to 
derive multivariate Opial type inequalities over a spherical shell of M. N , N > 2, 
by studying the topic in all possibilities. Our results involve one, two or more 
functions. We produce many univariate Caputo fractional Opial type inequali- 
ties several of these used to establish results on the shell. We give application 
to prove uniqueness of solution of a general partial differential equation on the 
shell. Also we apply our results for Ricmann-Liouville fractional derivatives. 
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1 Introduction 

This work is inspired by articles of Opial [22], Bessack [13], and Anastassiou- 
Koliha-Pecaric [11], [12], and Anastassiou [9], [10]. We would like to mention 

Theorem A. (Opial [22], 1960) Let c > and y (x) be real, continuously 
differentiable on [0, c] , with y (0) = y (c) = 0. Then 

c f c 2 

y(x)y'(x)\dx<- {y (x)) dx. 

4 Jo 

Equality holds for the function 

y (x) =ion [0, c/2] 



42 ANASTASSIOU: FRACTIONAL INEQUALITIES 



y (x) = c — x on [c/2, c] . 

The next result implies Theorem A and is very useful to applications. Also 
it is our main motivation. 

Theorem B. (Bessack [13], 1962) Let b > 0. If y (x) is real, continuously 
differentiable on [0,6] , and y (0) = 0,then 

b f b 2 

\y(x)y (x)\dx < - / (y'O)) dx. 
o z Jo 

Equality holds only for y ~ mx, where m is a constant. 

Opial type inequalities usually find applications in establishing uniqueness 
of solution of initial value problems for differential equations and their systems, 
see Willett [27] . In this article we present a series of various Caputo fractional 
multivariate Opial type inequalities over spherical shells. To achieve our goal 
we use polar coordinates, and we introduce and use the Caputo fractional radial 
derivative. We work on the spherical shell, and not on the ball, because a radial 
derivative can not be defined at zero. So, we reduce the problem to a univariate 
one. 

Therefore we derive and use a large array of univariate Opial type inequalities 
involving Caputo fractional derivatives; these are Caputo fractional derivatives 
defined at arbitrary anchor point o e K. In our results we involve one, two, or 
several functions. But first we need to develop an extensive background in two 
parts, then follow the main results. 

At the end we give application proving uniqueness of solution for a gen- 
eral PDE initial value problem. Also we reestablish our results by involving 
Riemann-Liouville fractional derivatives defined at an arbitrary anchor point. 

In this article to build our background regarding Caputo fractional derivative 
we use the excellent monograph [17]. 

Caputo derivative was introduced in 1967, see [14], also see [15], [16]. 

It happens that the Riemann-Liouville fractional derivative has some dis- 
advantages when is to model real-world phenomena with fractional differential 
equations. One reason is that the initial conditions there involve fractional 
derivatives that are difficult to connect with actual data, etc. However Caputo 
fractional derivative modelling involves initial conditions that are described by 
ordinary derivatives, much easier to write out of real world data. So more and 
more in recent years the Caputo version is usually preferred when physical mod- 
els are described, because the physical interpretation of the prescribed data is 
clear, and therefore it is in general easier possible to gather these data, e.g. 
by appropriate measurements. Also from the pure mathematics side there are 
reasons to prefer recently more and more the Caputo fractional derivative. 
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2 Background-I 

Here we follow [17]. 
We start with 
Definition 1. Let v > 0, the operator J%, defined on L\ [a, b] by 

J v J{x):=^ 7) J\x-t) v - 1 f{t)dt (1) 

for a < x < b, is called the Riemann-Liouvillc fractional integral operator of 
order v. 

For v — 0, we set J° := /, the identity operator. Here L stands for the 
gamma function. 

Theorem 2. ([17]) Let / g £i[a,6], v > 0. Then, the integral J%f{x) 
exists for almost every x G [a, b] . 

Moreover, J v a f G L 1 ([a, b}) . 

We need 

Theorem 3. ([17]) Let m,n>0,$eli ([a, b]) . 

Then 

j™js$ - j: i+n $ (2) 

holds almost everywhere on [a,b] . Lf additionally $ G C([a, 6]) or m + n > 1, 
i/ien i/ie identity holds everywhere on [a, 6] . 

We give 

Definition 4. ([17]) Let ^ G R + and m — \v~\ , [•] is the ceiling of number. 
The operator D v al defined by 

D v J:=D m J™-"f, D :=-£-, (3) 

ax 

is called the Riemann-Liouvillc fractional differential operator of order v. 

For v = 0, we set D° := /, the identity operator. 

If v G N then D v a f = f^ v \ the ordinary v order derivative. 

Next we give 

Definition 5. (p. 37, [17]) Let v > and n := \v~\ , a € K. Then, we define 
the operator 

D V J := J:'" &\ (4) 

whenever /(") G Li ([a, b\) . 

Also we need 

Theorem 6. (p.37, [17]) Let v > 0, n := [z/| . Moreover assume that 
/ G AC™ ([a, 6]) (space of functions with absolutely continuous (n — 1) —st deriv- 
ative). Then 



Kf = K (/ - T n -i (/; a)) , a.e. on [a, b] , (5) 

where 

T n _ 1 (f;a)(x):=J2^l}^(x-a) k , xe[a,b], (6) 



fc! 

k=0 
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is the Taylor polynomial of degree n — 1 of f, centered at a. 

Next we give the definition of Caputo fractional derivative ([17]). 

Definition 7. (p.38, [17]) Assume that / is such that D v a (/ - T„_i (/; a)) (x) 
exists for sonic x G [a, b] . Then we define the Caputo fractional derivative by 

D»J (x) := Dl (/ - T n _a (/; a)) (x) . (7) 

So the above definition applies to all points x £ [a, o] : D v a (/ — T„_i (/; a)) (x) € 
R. 

We have 

Corollary 8. Lei v > 0, n := [z/] , / e AC™ ([a, 6]). Then the Caputo 
fractional derivative 

Waf (X) = r , 1 , f X (X t)"""- 1 / (n) (t) dt (8) 

r (n - 1/) y a 

exists almost everywhere for x in [a, b] . 

We have 

Corollary 9. Let v > 0, n := \u\ , / e AC™ ([a, 6]) . ITien, D»J exists iff 
D v a f exists. 

Proof. By linearity of D v a operator and assumption. □ 
We need 

Lemma 10. ([17]) Let v > 0, n = \v~\ . Assume that / is such that both 
DIJ and D v a f exist. 
Then, 

DU (x) = Dlf (x) J2 r( C^l u (x a) k - . (9) 



k=0 



T{k-v+l) 



Lemma 11. ([17]) All as in Lemma 10. 

Additionally assume that f^ (a) = for k = 0, 1, . . . , n — 1. Then, 

DU = Dlf. (10) 

In conclusion 

Corollary 12. Let v > 0, n := |"i/| , / G AC™ ([a, b}) , D£ a / existe or D V J 
exists, and f^ (a) = 0, k = 0, 1, . . . , n — 1. TTien 

^/ - £*«/■ (11) 

We need the following Taylor-Caputo formula 

Theorem 13. (p.40, [17]) Let v>Q,n:= \v\ , / e AC™ ([a, 6]) . Tften 

fe=0 

V.x e [a, 6] . 
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Clearly here J^D^J E AC n ([a, b]) . 

Corollary 14. Let v > 0, n := \v\ , f € AC m ([a, b}) , and /W (a) = 0, 
k = 0, 1, . . . , n — 1. Then 

f (x) = r a DU (x) = ^- J\ x - if- 1 DU (i) dt. (13) 

We need 

Lemma 15. Let v > 7+1, 7 > 0. Call n :— \v\ , m := [7] . Then n — m > 1, 
i.e. m < n — 1. 

Proof. Clearly f > 1 and z^ > 7, v — 7 > 1. By 7 + 1 > m we get v > m, 
and n > m, that is f — m > and n — m > 0. 

We see that v > 7 + I > [7] + !, where [•] is the integral part. Thus 
v > ([7] + 1) £ N and v > [v] > [7] + 1. 

Therefore 

M - H > 1, (14) 

to be used next. 

We distinguish the following cases. 

i) Let 1/, 7 £ N, then [z/| = [j,] + 1, |" 7 ] = [ 7 ] + 1. By (14) we get ([v] + 1) - 
([7] + 1) > 1. Hence n — m > 1. 

ii) Let z/, 7 <G N, then [z/] = [z/| = 1/, [7] = [7] = 7. So by (14) n- m> 1. 

iii) Let v ^ N, 7 e N. Then n = [z/| = [z^] + 1, m = [7] = [7] = 7. Hence by 
(14) we have (\v~\ — 1) — m > 1, and [V| — m > 2 > 1. Hence n — m > 1. 

iv) Let z/ e N, 7 £ N. Then 1 + 7 < [7] + 1 = [7 + 1] , and 1 + 7 < v £ N 
by assumption. 

Therefore [7] + 1 < u, and v — [7] > 1. So that again n — m > 1. 

Claim is proved in all cases. □ 

We present the representation theorem. 

Theorem 16. Let f>7+l,7>0. Call n := \v\ , m := [7] . Assume f £ 
AC m ([a, b}), such that / (fe) (a) = 0, k = 0, 1, . . . ,n - 1, and D»j€ L^^b). 

Then 

D2J £ C ([a, b}) , D2J (x) = J?~ V (m) (*) , (15) 

azirf 



mj w = - — r / (x - tr^- 1 d:„/ (*) *, (i6) 

r {v - 7) Va 



V.x £ [a, 6] 



Proof. If 7 = then (16) collapses to (13) , also (15) is clear. So we assume 
7 > 0. By Lemma 15 we have m < n — 1. By the assumption / <G ^4C™ ([a, b}) 
we get that / £ C"" 1 ([a, 6]) and thus / £ C m ([a, 6]) . 

By Lemma 3.7, p.41 of [17] we get that L>7„/ = J™-rf(™) e (7 ([a, 6]) and 
-D*a/ (a) = 0, for 7 ^ N. Clearly the last statement is true also when 7 £ N, so 
proving (15) and first claim. 
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To remind, we have v — m > 0, and v — l>0by7>0. Using T (p+ 1) = 
pT (p) , p > 0, (13) and Theorem 7 of [9], we obtain 

f {m) (x) = r a - m D: a f(x), \/x<=[a,b}. (17) 

Therefore we get 

djj{x) = j™-v (m) (x) ( = 7) j^-^r a - m Dij{ X ) 

(by Theorem 2.2, p.14 of [17], and v - 7 > 1) 

= r a - 1 D:j{x), Vx£[a,b}. 

That is proving (16) . □ 

We also give the representation theorem. 

Theorem 17. Let z/ > 7 + 1, 7 > 0. Ca// n := \v\ , m := [7] . Let f G 

AC™ ([a, 6]), smc/i t/iat /^ (ft) = 0, fc = 0,1,..., n — 1. Assume there exists 
Daf (x) e M, Vx e [a, 6] , and £>£/ G L^ (a, 6) . Then 

Dlf G C ([a, b\) , DJ/ (x) = j^-7/M (a; ) , (i 8 ) 

Vx <G [a, 6] , 

^/ (x) = jrr^y £ (* - ty-~<- 1 Dlf (t) dt, (19) 

Vx G [a, 6] . 

Proof. By Corollaries 9 and 12 we get existing D" a f (x) € M, and that 
£>*a/ («) = D a/ 0) > Vx e K & ] ■ That is £>*<,/ e Loo (a, 6) and by (16) we have 

DU (x) = 1 f (x - t)"- 7 - 1 ^/ (t) dt, Vx G [a, 6] . (20) 

r (^ - 7) J a 

Since D 7 a / G C([a,b]) we get D 7 a /(x) G R, Vx G [a, 6] . And since / G 
C m ([a, 6]) then / G AC" 1 ([a, 6]) . By Corollary 9 L> 7 / exists. Also /W (a) = 0, 
for fc = 0, 1, . . . , m — 1. Thus by Corollary 12 we obtain DJf (x) = DJ a f (x) , 
Vx G [ft, 6] . Now by (20) we have established (19) . □ 

3 Main Results 

3.1 Results involving one function 

We present 

Theorem 18. Let v > 7+1,7 > 0. Call n := \v\ and assume f G 
AC m ([a, b}) such that /( fe ) (a) = 0, fc = 0, 1, . . . , n - 1, and D»J G Loo (a, b) . 

Let p, q > 1 such that — I — = 1, a < x < b. 
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The 



\D:j(u)\\(DU)^)\doJ< 



(x — a) p 



(</2)T{ V - 1 ){{p V - P1 - P +l){pv- P1 -p + 2)) 1/p 

IDUWfckv) \ (21) 



Proof. Similar to Theorem 25.2, p.545, [2], and Theorem 2.1 of [11]. □ 

A related extreme case comes next. 

Proposition 19. All as in Theorem 18, but with p = 1 and q = oo, we find 



|I>7 o /(u0l|I>: o /( W )|du;< 

(*-«rV,,n. ,1, \ 2 



i>- 7 + 2) V l|£> * a/Il ~.(«.-)J • (22) 

Proof. Similar to Proposition 25.1, p. 547, [2]. □ 

The converse of (21) follows. 

Theorem 20. Let v > 7+1,7 > 0. Call n := \v\ and assume f <G 
AC m ([a, b]) such that f {k) {a) = 0, k = 0,1,..., n- 1, and D»J, — ^ € 
Loo ( a : &) ■ Suppose that D" a f is of fixed sign a.e. in [a,b] . Let p,q such that 
<p <1, q<0 and - + - = 1, a< x <b. Then 

|I>: o /(a»)||I>r o /( W )|du;> 

pv—py—p+2 

(x — a) p 
{0)T{ v - 1 ){{p V -p 1 -p+l){p V -p 1 -p + 2)) 1/p 

IDUWfckv) '. (23) 



Proof. Similar to Theorem 25.3, p.547, [2], and Theorem 2.3 of [11]. □ 

We give 

Theorem 21. Let v > 2, k > 0, v > k + 2. Call n := \v\ and assume 
f G AC n ([a, b]) such that /W (a) = 0, j = 0, 1, . . . , n-1, and D»J e L x (a, 6) . 
Lei p, q > 1 smc/i i/iai - + - = 1, a < x < b. Then 

|^a/M||^a +1 /M|^< 
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2(pi;-pfc-p + l) 

{x — a) p 



2(v(v- k)y {pv-pk- P + iy 



ID'^fU^ckv) \ (24) 



Proof. Similar to Theorem 25.4, p.549, [2], and Theorem 2.4 of [11]. □ 
The extreme case follows. 

Proposition 22. Under the assumptions of Theorem 21 when p = 1, q = oo 
we find 

' X \D , lJ{uj)\\D 1 lt 1 f{uj)\duj< 

(x - afC- k U\\DU\L, (a , x) Y 

^ 2 ' ' . 25 

2(I>-fc+l)) 2 

Proof. Similar to Proposition 25.2, p. 551, [2]. D 

We give the related converse result. 

Theorem 23. Let v > 2, k > 0, v > k + 2. Call n := \v\ . Assume 
f G AC m ([a, b}) such that fW (a) = 0, j = 0, 1, . . . ,n - 1, and D»J, ^j e 
Leo (a,b) . Suppose that D" a f is of fixed sign a.e. in [a,b] . Let p,q such that 
<p < 1, q < and - + - = 1, a< x < b. Then 

|2>* /(w)||l£+7M|dw> 

2(py-pfc-p + l) 

(a; — a) p 

2 (r [y - k)f {pv-pk-p+ l) 2/p 

\D:j(w)\ q dw) \ (26) 



Proof. Similar to Theorem 25.5, p. 553 of [2]. □ 

Next we present 

Theorem 24. Let 7, > 0, v > 1, v — 7j > 1; i = 1, . . . , I, n := \v~\ , and 
f e AC n ([a, 6]) suc/i t/iat f {k) (a) = 0, fc = 0, 1, . . . , n-1, and £>£ a / e Loo (a, 6) • 
Here a < x < b; q\ (x) , 92 (#) continuous functions on [a, b] such that q\ [x) > 0, 

92 (a;) > on [a, b] , and r^ > : X^j=i r i = r - Let si, s'i > 1 : 1 — r = 1 and 

s 2 , So > 1 : 7- + -A- = 1, and p > s 2 . 

52 »2 

Denote by 

Qi (*) := f / (<?i M) 8 ' 1 duA ' (27) 
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and 



Q 2 (x):=(J\q 2 (u;)r s i /p ck?j \ (28) 

a := E^». (29) 

ps 2 



Then 



■ I 

?i («) J] l^a/ (w) P dc < Qx (x) Q 2 (x) 

t-t r <T riCT 

n( ( r^- 7l) r(z,- 7l -i+ar 

(a; _ a) (EL 1 (-7 I -iK + - + ^) 



( (Z)»=l (^ - 7* - 1) r i s l) + rs l cr + 1 



1/si 



g2 Hl^r a /Hl p ^) rP . (30) 



Proof. Similar to Theorem 26.1, p. 567, [2], and Theorem 2.1 of [12]. □ 

The counterpart of last theorem follows. 

Theorem 25. Let 7, > 0, v > 1, v — 7$ > 1; i = 1, . . . , I, n := \v~\ , and 
f G AC n ([a, b}) such that /W (a) = 0, fc = 0, 1, . . . ,n - 1, and D£ /, gfy e 
Loo ( a : &) ■ ffere a < a; < 6; gi (a;) , q 2 ( x ) > continuous functions on [a, b] and 
Ti > : 5Z»=i r * = r - Let < Si, S2 < 1 ond s' l7 s' 2 < such that — + 4- = 1, 
1 — r = 1. Assume that D" a f (t) is of fixed sign a.e. in [a, b] . Denote 

Qi (x) := (J (<Zi M) 5 ' 1 dej) * , (31) 

g 2 (*) := ( I (q 2 (cj)y s ' 2 dcj) ' . (32) 



Set 



The 



SlS 2 

sis 2 - r 



(33) 



£ gi H(ni£:-/Hrj^> 
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Q\ (x) Qi (x) 



nL |(r (y - 7i)) ri {{y - 7,-1) ^i + i) ('^) 

( x _ fl ){(EU r 4 ((«/- 7 i-l)«i+sj 2 ))+l}/«i 



{(ELn((^-7i-i)si + s 2 - 2 )) + i} 

^HI^:a/HI AS2 ^) • tfu 



1/si 

"/AS2 



Proof. Similar to Theorem 26.2, p.570, [2], and Theorem 2.3 of [12]. □ 

A related extreme case comes next for p = 1 and q = oo. 

Theorem 26. Lei f > 7, + 1, 7$ > 0; i = 1, . . . , I. Call n := \v\ and assume 

f e AC" ([a, 6]) sucft i/iat / (fc) (a) = 0, fc = 0, 1, . . .,n-l, and D£ a / e L^ (a, 6) . 

//ere a < x < b, with < q (u>) € Loo (a, &) owd r* > : X)i=i r « = r - Then 



q{u)J{\DZf{uT^< 

Halloo, (a,x) ( ll-^*a/Hoo,(a,x) 



7 — ^ 7^ > • (35) 

Proof. Similar to Proposition 26.1 of [2], p. 573 and Theorem 2.2 of [12]. 
□ 

We continue with the interesting 

Theorem 27. Let k > 0, 7 > 1, v > 2, n := \v~\ , such that v — 7 > 1, 
7 - & > 1, and f e AC™ ([a, 6]) s«c/i tfiai / (j) (a) = 0, j = 0, 1, . . . ,n - 1, and 
D* a f G Loo (a, 6) • tfere a < x < b, p,q > 1 : - + - = 1. Then 



l^a/Hl|^a/H|^< 

2- 1 /P(x-a)( 2 ^ fc -^ 1+ f) 

r (1/ - fc) r (1/ - 7 + 1) ((1/ - 7) « + i) 1/q 



(2isq-kq- lq -q + 2) 1/q ' 
Proof. Similar to Theorem 26.3, p.574, [2], and Theorem 2.5 of [12]. □ 



(36) 
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We give 

Theorem 28. Let v > 7* + 1, 7* > 0, i = 1, . . . , k e N - {1} , n := \v\ . 

Assume f G AC™ ([a, 6]) such that /<>') (a) = 0, j = 0, 1, . . . ,n - 1, and D»J G 
Loo (a, b) . Here a < x < b, 7 := X^=i7i- ^ e ^ P><7 > 1 swc/i i/iai - + - = 1. 
Furthermore, suppose that \D" a f (t)\ is decreasing on [a,x]. Then 

k 



n\DZf(uj)\dLo< 

i=l 

/ 1 + fcfp — 7P \ 

p{x — ap p 1 
IL=i r (^ - 7») ( fciy P - IP - k P + ! 

(ri^a/wi fcg rf*) 1/g 

(fci^p — 7p — kp + p + 1) 



,1/p 



(37) 



Proof. Similar to Theorem 26.6, p. 581 of [2], and Theorem 2.6 of [12]. □ 

The extreme case follows 

Theorem 29. All as in Theorem 28, but p = 1, q = 00. Then 



IJl^i/Hldw 



< 



i=l 



(*-o)**'- 7+1 (ll^:./lloo,(-,-))* 

(38) 



(nti r (1/ - 7i)) (^ - 7 - A + 1) {kv - 7 + 1) ' 
Proof. Similar to Proposition 26.4, p. 582 of [2], and Theorem 2.7 of [12] 



□ 



3.2 Results involving two functions 

We present 

Theorem 30. Let v > 7, + 1, 7, > 0, i = 1,2, n := |"i/| , anrf /1, /2 G 
AC™ ([a, 6]) sucft tfiai /^ (a) = / 2 (j) (a) = 0, 3 = 0, 1, . . . ,n - 1, a < x < b. 
Consider also pit) > and q{t) > 0, wii/i aZZ p(t), -ttt, g (£) G L^ (a, 6) . 
Further assume D" a fi G Loo («, 6) , i = 1, 2. 

Lei A y > and A a , A^ > suc/i i/iai A„ < p, where p > 1. Set 

P k (cj):= (cj-t) p-i (p (t) ) (^T) dt) (39) 
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k= 1,2, a < x < b; 



A(w) 



>H(p 1 M) a °( £ ^)(p 2 H) a ^ £ ^)(pH)- a - /p 
(i>- 7l )) A «(i>- 72 ))^ 



A (x):= / (A(u))*^du 



and 



Si 



2 1 "(^^), i/Aa + A„<p, 
1, i/ A Q + A, y >p. 



If \p — 0, we obtain that 



q(u) \DZf 1 (u J )\ X -\D:j 1 (o J )\ K + 



(40) 
(41) 
(42) 



\DZf 2 (u;)\ x »\D:j 2 (u;)\ K 



(A (x) \ Xfi =o) 



K 



dui < 



A Q + A„ 

pHiic/iHr+ic^Hr^ 



{ha ^ ) 



(43) 



Proof. Similar to Theorem 2 of [3] and Theorem 4 of [9]. □ 
It follows the counterpart of the last theorem. 
Theorem 31. All here as in Theorem 30. 
Denote 

2^/^-1, z/A^>A„ 

1, if A/? < K- 



If X a = 0, then 



q(cj) \D:if 2 (cj)\^\DUiH\ K + 



(44) 



\D2lh(cu)\^\DU2^)\ K 



(Ao (x) | Ao=0 ) 2^ /X A 



A/3 + A„ 



do; < 

Wp 



rK/p 



(45) 



pH[|^ a .AHI p + |^ a / 2 Hn^ 



all a < x < b. 
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Proof. Similar to Theorem 3 of [3] and Theorem 5 of [9]. □ 
The complete case A Q , Xp ^ follows. 
Theorem 32. All here as in Theorem 30. 
Denote 



7i 



and 



Then 



02 



;= J 2l *» ) -1, if X a + X >X v , 
{ 1, «/ A Q + Xp < X v , 



1, z/ A Q + A/9 + A„ >p, 

* J, i/ A Q + A^ + A^ <p. 



(46) 



(47) 



g( W ) |^/i H| A ° |^/ 2 (u;)^ !£>:„/! H| A " 



|DS/l Ml^ l^a/2 Ml"* l^a/2 Ml'" 



dw < 



(48) 



A {x) 



X„ 



K/v 



(X a + X p ) (X a + Xp + x v ) 

p{w){\D: a h{w)\* + \D:j 2 {w)\ p )du 



X^ /P J2 + 2 P ~^ (7iA^) A " /p ' 



aZZ a < x < b. 

Proof. As Theorem 4 of [3], and Theorem 6 of [9]. □ 

We continue with the special important case 

Theorem 33. Let v > 71 + 2, 71 > 0, n := \v\ and f u f 2 € AC n ([a, b\) 
such that fi (a) — f 2 (a) = 0, j — 0,1,..., n— 1, a < x < b. Consider 
also pit) > and q(t) > 0, with all p(t) , ~m, q(t) € £00 («, &) . Furthermore 
assume D% a fi £ L^ (a, b) , i = 1, 2. 

.Lei A a > 0, < A Q+ i < 1, and p > 1. Denote 



0a := 



2WW-1, »/A a >A a+ i, 
1) */ A Q < A a+ i, 



L(z) 



(g(w))V 1 - A «+Wdw 



(l-A a+ i) 



C3^a+1 

A a + A a+1 



and 



P 1 {x):=\ (x-t) (J ^ }Ji (p(t))- mp - 1) dt, 



T(x) :=L(x) 



PiW 



( p-l \ \ (^a+^a + l) 



r(i/-7!) 



Wl 



. = 2 (^)(A a +A a+1 ) 



(49) 

(50) 

(51) 

(52) 
(53) 
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and 
Then 



$0) :=T(x)u>i. 



«M l^/iMI Ao |^ +1 /2Mr o+1 + 



(54) 



|D£/ 2 MI Ao |D£ +1 /i(w)r +1 



do; < 



$(z) 



pMdD^AMrH- 



1^/2 mpw~ 



(55) 



all a < x < b. 



Proof. As in Theorem 5 of [3], and Theorem 8 of [9]. □ 
We give 

Corollary 34. All here as in Theorem 30, with \@ = 0, p(t) = q(t) = 1. 
Then 



\D^ a f 1 ( ( j)\ Xa \D^ a f 1 (u)\ K + 



\D2lf 2 (u>)\ X "\D:j 2 (w)\ X » 



du < 



(56) 



d (*) (J (|£>r o /i (w)| p + |^ / 2 M| p ) dcj 
all a < x < b, where 

A~ 



(^) 



C 1 (x):=(A (x)\x^)(j^^j " P 5 U 



5i:= 

We find that 

(A) (&) U f ,=o) = 



2 1 -(^^), i/A a + A„<p, 
1, z/ A Q + A, y >p. 



(P-1)C S J 

,(r(i/-7i)) A ° {vp-'yip- ir~ 

.(Aa^p- A a 7ip- A a +p- A„)^ p V J 

' ^a vp— A a -yip — A a +p — X u \ 



(57) 
(58) 



(x-a)( : 



(59) 



Proof. As Corollary 1 of [3], and Corollary 10 of [9]. □ 
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Corollary 35. (All as in Theorem 30, \p = , p (t) = q (t) = 1 , X a = A„ = 
l,p=2.) 

In detail: Let v > 7l + 1, 71 > 0, n := \v\ , /1, f 2 e AC n ([a, b}) : f[ j) (a) = 
/ 2 W) (o) = 0, i = 0,1,. . . ,n - 1, o < x < 6, D^/j € ioo (a, 6) , t = 1,2. TAera 



[|(^/ 1 )H||(^a.A)HI + 



|(^/ 2 )H||(^ a / 2 )M|]^< 

' {x-a)^-^ * 

2r (1/ - 7T) y^^TTv^T^^^T 



(p: a / 1 )H) 2 + (pr a / 2 )H) 2 ^ 



aZZ a < x < b. 

Corollary 36. (to Theorem 31; X a = 0, p(t) = q(t) = 1.) 

"""i^/ 2 mi a "|i>: /i(w)i a, ' + 



|DK/iM| A " \TKahWr 



du> < 



all a < x < b, where 



C 2 (x):=(Ao(x)\ Xot =o)2^- 



K 



As + A y 



A„/p 



rA.,/p 



FFe find that 

(A (x) \ Xa =o) = 



1, «/ A/3 < A„ 



(p-l)l 



(i>- 72 ))^(^-7 2 p-i)V - 
(p-a,)( e ^) \1 



XXpvp- \p-f2P- Xp+p- X V ) K p >, 



(x-ay 



(60) 



(61) 



(62) 
(63) 



(64) 
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Corollary 37. (to Theorem 31; X a = 0, p (t) = q(t) = 1, A/3 = \ v = 1, p = 2.) 
In detail: 

Let v > 72 +l, 72 > 0, n := \v~\ , /i, f 2 € AC" ([a, 6]) : f[ j) (a) = f® (a) = 0, 
j = 0, 1, . . . , n - 1, a < x < b, D»J % e L x (a, b) , i = 1, 2. 

TTien 



[|^/ 2 Hll^ a /iMI + 

|i>2S/iMiii>r /2(w)i]dw< 

' (x-a) (i/ ' 72) ' 

\/2r (i/ - 72 ) 7^^/2^-272-^ 

(L\Vx (a;)) 2 + (D- / 2 M) 2 ) do; 



(65) 



aZZ a < x < b. 

We continue with related results regarding IHI^ . 

Theorem 38. Let v > 7, + 1, 7; > 0, i = 1, 2, n := |"i/| and /1, / 2 <G 
AC" ([a, 6]) sucft tfiat /^ (a) = / 2 (j) (a) = 0, j = 0, 1, . . . ,n - 1; a < x < b. 
Consider p(x) > and p(x) e L 00 (a,b) 1 and assume D" a fi <G L OQ {a,b) , 
i = 1,2. Lei A a , A/3,Aj, > 0. 5ei 



T(x) 



(a; — a) 



(^A„— 7iA a +^A^ — feA^+l) 



Then 



(v\ a - 71 A Q + z/A/3 - 72A/3 + 1) 

l|p( S )lloo,(a,x) 

(r(^-7i + i)) A "(r(^- 72 + i))^' 



p(w) \D2lh Ml'" I^S/2 H| A ' J \DUi n\ K 



\Diih MI A " I^S/2 HI Aa |^a/2 MI A " 



da; < 



T(z) 



7-)" f || 2 (A a +A„) 
l i/ *a/l|loo,(o,x) 



\DUi 



,2X 

loo,(a,x) 



(66) 



l^a/2 



2A fl 



:{ a > x ) 



\D: a h 



,2(A a +A„) 

loo,(a,a:) 



(67) 



all a < x < b. 

Proof. Similar to Theorem 7 of [3], and Theorem 18 of [9]. □ 

We give 

Corollary 39. (to Theorem 38) Let v > 71 + 2, 71 > 0, n := |"i/| , /1, 

/ 2 G AC" ([a, 6]) suc/i tfiat /^ (a) - / 2 0) (a) = 0, j = 0, 1, . . . , n - 1; D^ a fi & 
Leo (a, b) , i = 1, 2. TTien 



ANASTASSIOU: FRACTIONAL INEQUALITIES 



57 



[|0£/i(w)||i>£ +1 /2M| + 

DZ +1 fi(u)\\D^ a f 2 (w)\]du 
{x - af {u ^ l] 



< 



2(r>- 7l + i)) 2 

\ D *ah\\oo,(a,x) + ll- D *o/2|loo,(o,x) 



(68) 



all a < x < b. 

Next we give converse results involving two functions. 

Theorem 40. Let 7, > 0, 1 < v - jj < i, < p < 1, j = 1,2; n := [i/| , 

a™d fi, h e AC™ ([a, 6]) s«c/i tfwrf /^ (a) = / 2 W (a) = 0, / = 0, 1, . . . ,n - 1, 
a < x < 6. Consider also pit) > and g (t) > 0, wzi/i aZZ p(t), -4y, ?(£), 
-Try € £00 (fl> 6) • Further assume D^ a fi E L^ (a, 6) , i = 1,2; eac/i of which is 
of fixed sign a.e. on [a, 6] . Let A„ > and X a , Xp > suc/i i/iai A^ > p. 
i/ere P& (w) , A (w) , Aq (x) are as in (39) , (40) , (41) , respectively. 

Set 

5i:=2l _ ( ^)_ 



(69) 



If \p — 0, then 



q(u) \DZh H| A « i^a H| A " + 



|i>2i/2MI Ao l^a/2MI 
(A) (») k,=o) 



*1 



A Q + A„ 



(^) 



(70) 



Proof. Similar to Theorem 5 of [8] and Theorem 4 of [7]. □ 

We continue with 

Theorem 41. All here as in Theorem J^0. Further assume Xp > X v . Denote 



If X a = 0, then 



5 3 :=(5 2 -l)2- (x »/ x »K 



«h i^/2Mi A/, ii)r„/iMr+ 



(71) 
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\DZlh{uo)\^\D:j 2 {u)\ K 



du > 



(A (x) \ Xa=0 ) 2^ 



°3 



A/3 + A„ 



p{u)[\D: a h{u)\ p + \D:j 2 {u)\ p ]du 



V+A^ 



all a < x < b. 



(72) 



Proof. Similar to Theorem 6 of [8], and Theorem 5 of [7]. □ 

We give 

Theorem 42. Let v > 2 and 71 > such that 2<v — ^i<-,0<p<l, 

n := \v\. Consider f t , f 2 G AC n ([a, b}) such that f[ l) (a) = / 2 (0 (a) = 0, 
Z = 0, 1, . . . , n — 1, a < x < b. Assume that D^ a fi G L^ (a, 6) , i = 1,2; eacft 0/ 
which is of fixed sign a.e. on [a,b] . Consider also p(t) > and q(t) > 0, with 
all p(t), tffi, q(t), ^ty <E ioo (a, &) • £et A Q > A Q+ i > 1. Denote 



3 .- ^"(WW) _ A 2 -A Q /A Q+1; 



(73) 



L (x) as in (50) , P\ (x) as in (51) , T (x) as m (52) , u)i as in (53) , and $ 
as in (54) . TTien 



«H \DZf 1 (^\DZ + \f 2 (co)\ X - +1 + 



|^/ 2 M| A °|^ +1 /iM| 



dw > 



$(x) 
all a < x < b. 



pHd^Hr + ID^HDA; 



•^o+Vhl' 



(74) 



Proof. Similar to Theorem 7 of [8], and Theorem 7 of [7]. □ 

We have 

Corollary 43. (to Theorem 40; X fj = 0, p(t) = q(t) = 1). Then 



\DZlh{u)\ X ~\D: a h{u)\ K + 



\DZf 2 (oj)\ X «\D:j 2 (u)\ K 



du> > 



ci(x)[ (ii>r /iMi p +ii>r /2Mndu; 



(^) 



(75) 
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all a < x < b, whe 



d (x) := (A (x) | A , =0 ) 



K/p 



Si, 



^ VJ \X a + X 

Here (A (x) \\ =o) is given by (59) . 

Corollary 44. (to Theorem 41; A Q = 0, p(t) = q(t) = 1, Xp > A„). Then 



(76) 

(77) 



|D^/ 2 H| A ^|^ a /iM| V + 



\DZlh{ut* \D:j 2 {w)\ K 



du > 



c 2 (x){l [|i>:;,.AM|" + |ir ( ,/ 2 Ml"] ^ 

all a < x < b, where 



'A1/+A/3- 



C 2 (x):=(A ( 2 ;)U Q= o)2 — 
i/ere (A (x) |a„=o) «s flwen & 2/ (64) . 



A " \ tA,/p 

"3 



A/3 + Ay 



(78) 



3.3 Results involving several functions 



We present 

Theorem 45. Here all notations, terms and assumptions are as in Theorem 
30, but for fj e AC" ([a, 6]) , wi£/i j = 1, . . . , M € N. Instead of 5i there, we 
define here 



si 



M 1 ^^^), tfX a + X v 
2(~ p~ J" 1 , i/A a + A„ 



Call 



If Xfj = 0, i/ien 



pi (a) := (A (x) | A/3=0 ) 



A,. 



A Q + A y 



>P- 



Wp 



jT%( W ) f J2\ D *^ M^ l^a/i Ml*" I **> 



< #j>i (a;) 



(79) 



(80) 



(81) 
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all a < x < b. 

Proof. As in Theorem 2 of [4], and Theorem 4 of [10]. □ 

We continue with 

Theorem 46. All here as in Theorem 45. 

Denote 

2^/^-1, if\ (3 >K, 

1, if A/? < A„, 



£2 := 



M 



1, if X v + Xp>p, 

y " ) if\„ + \ <p, 



and 



<p 2 {x) :=(A {x)\ Xa =o)2^ 



(^) 



A,. 



A/3 + A y 



Wp 



r-Wp 



If X a = 0, i/ien 



M-l 



jf%( W ) I | £ [l^/i+lMl^l^a/iMI^ 

|^/,MI A n^ a / J+ iMI A "]} + 

|^.fMMI^|^ a /iHI^ + 
I^AHI v PI a /MHI A "]}^ 

P(w) 
da; 



<2l » )e 2 ip 2 {x) 



M 



J'=l 



all a < x < b. 



Proof. As Theorem 3 of [4], and Theorem 5 of [10]. □ 

We give the general case 

Theorem 47. All as in Theorem 45. 

Denote 

._ J 2l *- ) -1, t/Aa + A^A,,, 

1, «/ A Q + A/3 < A^, 



7i 



and 



02 



' A a +J,j + J„ 



if K + X p + X v > p, 
, z/ X a + Xp + X v < p. 



(82) 
(83) 

(84) 



(85) 



(86) 



(87) 
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Set 



ip 3 (x) ■- A (x) 



K 



(A« + \[j) (A Q + \ fi + \ v ) 

A c f72 + 2( E ^)(7iA /3 )^ 



and 



The 



£3 



M 



1, 



«/ A Q + A/3 + \ v > p, 
if A Q + A/3 + A„ < p. 



?H 



M-l 



2 [ITO/,-) MI Aq |(£>22/i + i) MI A ^ K^/i) HI A " 



i=i 



+ |(D^/i) MI A ' ITO/i+i) MI Aq K^/i+i) MI A " 



IWi/o M| A ° |(^/ M ) M| a ^ |(^a/i) MI A " 



+ \(D:ih) (c)i A ^ K^/ M ) (c)i A ° \(d:j m ) (c)| a - 



cia; 



' A +A s +A„ 



<2V v )e 3 y 3 {x)-\ / p (a,) 



' A a + A fl +A„ ' 



M 



X)l(^a/i)M| P 
J'=l 



dw 



all a < x < b. 



(88) 
(89) 



(90) 



Proof. As Theorem 4 of [4], and Theorem 6 of [10]. □ 

We give 

Theorem 48. All here as in Theorem 33, but for fj e AC n ([a, b]) , j 



l,...,MeN. 
Also put 



£4 



1, 

' A a + A 



M 



j/ \ a + \ a+1 > p, 
if X a + \ a+1 < p. 



(91) 



The 



M-l 



?H E [l(^/i)MI A<, |(^ +1 /i + i)M| A<>+1 



i=i 
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+ \(D? a f j+1 ) (co)\^ \(DZ +1 f,) M| Aq+1 ] } 

+ [\(DZf 1 )(u;)\ X <'\(DZ +1 f M )(u,)\ Xa+1 
+ \(DZf M ) (lu)\^ | (2>£ +1 /i) H| A « +1 ] } do, 

p{u) 



< 2^ 



M 



>£A<b(x) 



J2\(DU 3 )^)\ p ]dco 



all a < x < b. 



(92) 



Proof. As Theorem 5 of [4], and Theorem 7 of [10]. D 
We continue with 

Corollary 49. (to Theorem 45, A^ = 0, p(t) = q(t) = 1, X a = X v = 1, 
p = 2). In detail: 



Let v > 7l + 1, 7l > 0, n := [v] , f 3 G AC n ([o,6]) , j = 1, 
a < x <b, and D" a fj <G £oo (a,b) , j = 1, . . . , M. 

Fere /j° (a) = 0, Z = 0, 1, . . . , n - 1; j = 1, . . . , M. 

Then 



,M € N: 



I 2r (i/ - 71) y/V^-^'lv - 2 7l - 1 i ' 



M 



E(^a/i(«)) ! 



da; 



(93) 



oZZ a < x < 6. 

Corollary 50. (to Theorem 46, X a = 0, p(t) = q(t) = l,Xp = X u = 1, 
p = 2). In detail: 

Let v > 72 + 1, 72 > 0, n := |Vj , /j G AC" ([a, 6]) , £>£„/,- G L^ (a, 6) , 
j = 1,...,MgN; a< x< b. 



Hersfj l) (a)=0,l = 0,l,...,n-l;j = l, 

Then 



,M. 



M-l 
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< 



+ \(DJlfj)(u;)\mj J+ i)(w)\}} 

+[\(D2if M )(uj)\mji)(u)\ 

+ \(D? a f 1 )(w)\\(IK a f M )(u,)\]}d U 

T (is - 72) yjv - 72\/ 2jy - 272 - 1 



M 






duo 



(94) 



all a < x < b. 

Corollary 51. (to Theorem 48, X a = 1, A Q+ i = 1/2, p = 3/2, p(t) = 
q (t) ~ I). In detail: 

Let v > 71 + 2, 71 > 0, n := |"i/| , and /,- e AC" n ([a, &]) , j = 1, . . . , M e N, 
such that /• (a) = 0, I = 0, 1, . . . ,n — 1, a < x < 6. Assume also D" a fj G 
L 00 (a,6),j = l,...,Af. 

Set 

2 \ (x-a)^ 5 ^ 

.^-37i-2j (I>- 7l )) 3/2 ' 
all a < x < b. Then 



$*{x) 



(95) 



M-l 

E 



K^/,)H|J|(^ +1 /, + i)h 



+ |(D2i/,- + i) (w)| ^|(z??i +1 /i)M 

+ k^/m)hi^/|(^ +1 /i)h 



dw 



< 2<T (x) 



£(Eip^)mi 3/2 )^ 



(96) 



aZZ a < x < b. 

We continue with results regarding 1 1 • 1 1 . 

Theorem 52. AM as in Theorem 38 but for /, e AC" ([a, i»]),j = l,...,Me 
N. Then 
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M-l 



P (w) 1 1 J2 [\(DZfj) mi a = \(Dz.f J+1 ) (wt* mjj) mi ; 



3 = 1 



+ \(DTJ 3 ) (lo)\^ \(DZf j+1 ) M| A " |(D: o / i+1 ) n\ K ] } 

+ [|(^A) (cu)\ x - \{D2if M ) ( W )| A " !(£>:„/!) M| A ^ 

+ |(D^/i) M| A ^ |(£^/ M ) M| a " |(^ /m) M| A "] } dc 



(97) 



3 = 1 



all a < x < b. 



Proof. Based on Theorem 38. □ 

We give 

Corollary 53. (to Theorem 52) In detail: 

Let v > 7i + 2, 7i > 0, n := \u\ and fj e AC" ([a, b]) , j = 1, . . . , M € N, 
such that / (a) = 0, I = 0, 1, . . . , n — 1; j = 1, . . . , M; a < x < 6. Further 
suppose that -D* a /j € Loo (a, 6) , J ' = 1, • • • , M. Then 

r x ( (M-l 

J a E [l(^/,)HI|(^ +1 / J+ i)H| 

+ |(^ +1 /,)H|l(^/, + i)MI]} 

+ [|(^/i) Ml |(d^ + 7m)M| 

+ |(^ +1 /i)H||(^/m)HI]}^ 

x2(i/- 7 i) \ / M 



< 



(x — a)* 



(r(i/-7! + i)r 






Jlloo 



(98) 



a/Z a < x < 6. 

We continue with converse results. 

Theorem 54. Lei jj > 0, 1 < v — 7, < -, < p < 1, j = 1, 2; n := \v\ , and 

fi G AC" ([a, 6]) , i = 1, . . . , M e N, such that ff> (a) = 0, I = 0, 1, . . . ,n - 1; 
i = 1, . . . , M; a < x < b. Consider also p(t) > and q (t) > 0, wit/i all p (t) , 
J^y, q (t) , J^y € Loo (a, 6) . Further assume D^ a f t e L x (a, 6) , i = 1, . . . , M; 
eac/i of which is of fixed sign a.e. on [a, b] . Let \ v > and A Q , A^ > swc/i that 
K > P- 
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Here Pk (u>) , k = 1, 2, A (u>) , Aq (x) are as in (39) , (40) , (41) , respectively. 



Call 



ip! (x) := (A (x) | A/3 =o) 



A, 



A Q + X v 

1 Ma + V t 

51 := M 1 ^—^) . 



K/p 



If Ap — 0, then 



J\{u>) If^Wafj Ml*" \ D *Ji Ml*" I du; 



> 5\ip\ (x) 
all a < x < b. 



J%(cj)lf2\DUjn\ P )du; 



( ^±^ } 



Proof. As Theorem 11 of [7], and Theorem 11 of [8]. □ 

We continue with 

Theorem 55. All as in Theorem 54- Assume Xp > A„. Denote 



cp 2 (x):=(A (x)\ Xa=0 )2( P -^) 



where 63 is as in (71) . If X a — 0, then 



M-l 



A„ 



A/3 + A„ 



Wp 



rWP 
J 3 



q(u)ll J2 [\(DZf J+ i)^)\ X0 \(D:jj)^)\ K 



j=i 



+ i(^/ J )Hi^ip: a / J+1 )Hi A "]} 

+ [\(D:if M )(cj)\^\(DUi)H\ K 
\{D? a h) (cj)\^\(DUm) M| A -] } dc > 



M 



'**+*£ \ ^ A^+Ag N 



'2^ - ^(^H / p(w) 



M 



J2\(d:j j )(w)\ p 
3=1 



dio 



all a < x < b. 

Proof. As Theorem 12 of [7], and Theorem 12 of [8]. □ 



(99) 
(100) 

(101) 



(102) 



(103) 
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We give 

Theorem 56. Let v > 2 and 71 > such that 2 <v — 7i<-,0<p<l, 

n := \v\ . Consider f % G AC n ([a, b]) , i = 1, . . . , M € N, such that ff ) (a) = 0, 
I = 0, 1, . . . ,n — 1; i = 1, . . . , M; a < x < b. Assume that D" a fi G Loo (a, b) , 
i = 1, . . . , M; each of which is of fixed sign a.e. on [a, b] . Consider also p(t) > 
and q (t) > 0, with all p(t) , Jjy, g (£) , ^y G Loo (a, &) • Lei A Q > A a+ i > 1. 

ifere #3 is as in (73) , L(x) as in (50) , P\ (x) as in (51) , T (x) as in (52) , 
u>\ as in (53) , and $ as m (54) . 

x ( (m-i 

j a «M j J E [l(^/,-)MI Ao |^ +1 /i + iM| +1 

+ |(r>£/ j+1 ) HI A ° |DZi +1 / J - M| Aa+1 ] } 



|(d^/i)M| a °|^ +1 /mM| 



+ |(^/ M )H| A -|^+ViH| Aa+1 ]} 



rfw > 



/ A Q+ A Q+1 X / A a +A a + 1 X 

M I » M " J$(x) 



^pHf^ipr^OHrj^ 



all a < x < b. 



Proof. As Theorem 13 of [7] and Theorem 13 of [8]. □ 

We have the special cases. 

Corollary 57. (to Theorem 54, A/3 = 0, p (t) = q (t) = 1). 

Then 



J* (f^\D? a fA»)\^\DMj(u)\ X Adu> 



> S^if! (x) 



M 






da; 



(^) 



(104) 



(105) 



a// a < x < 6. 

In (105) , (A (x) |a,3=o) of ipx (x) is given by (59) . 

Corollary 58. (to Theorem 55, A Q = 0, p(t) = q(t) = 1). It holds 




M-l 



E [i(^/,-+i)Mi A/, i(^r«/i)Mi ; 



3=1 
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+ \(D2lfj)(uj)\^\(DU J+ i)^)\ K ]} 

+ [i(^/ M )Hi^i(^r a /i)Mi v 

\{DZlh){u)\ X e\{D:j M ){ut»]}d^> 

M v p ^ ) 2^ p ^2(2;)- 



M 



3=1 



dw 



all a < x < b. 

In (106) , (A (x) \\ a =o) of 1P2 (x) is given by (64) . 
Next we apply above results on the spherical shell. 



4 Background-II 



(106) 



Here initially we follow [24], pp. 149-150 and [25], pp. 87-88. Let us de- 
note by dx = A r n (dx) , N £ N, the Lebesgue measure on R N , and 5 ,JV ~ 1 :— 



norm in 



{x e 



\x\ = 1} the unit sphere 
N . Also denote the ball 



P-V 



where |-| stands for the Euclidean 



B(0,R) := {ieR" 
and the spherical shell 



\x\ <R}CR N , R>0, 



A:= B (0, R 2 ) - B (0, R x ), < R x < R 2 . 

For x G R N — {0} we can write uniquely x = ru), where r = \x\ > 0, and 
lo = ^ e 5 ,Ar - 1 , |w| = 1. Clearly here 



and the map 



$ 



l N - {0} = (0, 00) xS N -\ 
l N - {0} -» S"- 1 :*(&) = 



x 

R 



is continuous. 

Also A = [i?i,i?2] x S N_1 . Let us denote by du> = Agw-i (w) the surface 
measure on S 1 ^^ 1 to be defined as the image under <I> of N- Arn restricted to 
the Borel class of B (0, 1) — {0} . More precisely the last definition has as follows: 
let A C S N ^ be a Borel set, and let 
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A := {ru : < r < 1, u E A} C R N , 

we define 

\ s n-i (A) = N ■ A r n (a\ . 

Bx here stands for the Borcl class on space X. 
We denote by 

Wjv = XsN -' ( s ) = 7^ d " = rp) 

the surface area of S^" 1 and we get the volume 

0J N r N 2ir N / 2 r N 



\B(0,r)\ 



N NT (N/2) ' 

so that 

2ir N / 2 
|B(0,1)| -" 



AT (AT/2) ' 
Clearly here 



Vol (A) = \A\ 



lo n (R? - R») 2ir N / 2 (R? - R") 



N NT (N/2) 

Next, define 

$ : (0,oo) x S"- 1 ->R N - {0} 

by ip (r, u)) :— rm, tp is one to one and onto function, thus 

(r,oo) = ^- x (x) = (|a|,*(aO) 

are called the polar coordinates of x € R N — {0} . 

Finally, define the measure Rjy on ((0,oo) , <B(o,oo)) by 



Rn (T) = j r N - l dr, any T € B {0iOo) . 



We mention the very important theorem 

Theorem 59. (see exercise 6, pp. 149-150 in [24] and Theorem 5.2.2 pp. 

87-88 of [25]) We have that A r n = (R N x X s n-i) o tp" 1 on S R w„{ }. 

In particular, if f is a non-negative Borel measurable function on (R^, B r n) , 
then the Lebesgue integral 

f(x)dx= f r"- 1 ( ( f (ru) X S m-i (du) J dr 

J(o,oo) KJs"- 1 J 

f (ru>) r N - x dr ) A s «-i (du) . (107) 

S"- 1 W(0,oo) / 
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Clearly (107) is true for f a Borel integrable function taking values in M. 
Based on Theorem 59 in [5] we proved the next result which is the main tool 
of this section. 

Proposition 60. Let 

/ : A - R, 
be a Lcbesguc integrable function, where 



A := B (0, R 2 ) - B (0, R x ), < Rx < R 2 . (108) 

Then 

f(x)dx=[ [ / / (ra) ^^dr ] dw. 

We make 

Remark 61. Let F : A = [i?i,i? 2 ] x S 1 ^ 1 -► R and for each w G S^ -1 
define 

ff„(r):=F(r«) = F(a:) ) 



where x G A, with A := B(0,i? 2 ) - B(0,i?i); < i?i < r < i? 2 , r = |x| , 

For each ui G S 1 ^^ 1 we assume that g^ G AC" ([i?i, i? 2 ]) , where n := \v\ , 
^>0. 

Thus, by Corollary 8, for almost all r G [i?i,7? 2 ] , there exists the Caputo 
fractional derivative 

DZ Rl 9 u W = w 1 , f (r - i)"" 1 - 1 .e } (*) (ft, (109) 

i (n-v) j Rl 

for all wG S^" 1 . 

Now we are ready to give 

Definition 62. Let F : A -> R, v > 0, n := \v\ such that F (-w) G 
AC™ ([i?i, i? 2 ]) , for all w G S*^ 1 . 

We call the Caputo radial fractional derivative the following function 

^(*)._ i r (r _ ir -^M d , (110) 



dr" r (n — v) J R dr n 

where x G A, i.e. x = rw, r G [Ri,R 2 ] , cj G S N_1 . 
Clearly 



<9r° 



F(x), 



Above function (110) exists almost every where for x G A. 
We justify this next. 
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Note 63. Call 

r d v F (x) ~] 

A i: = re [Ri,R 2 ] ■ ' v does not exist \ . (Ill) 

We have that Lebesgue measure Ar (Ai) = 0. 

Call An := Ai x S 1 ^^ 1 . So there exists a Borel set A£ c [i?i, R 2 ] , such that 
AiCA;, A r (AJ) = Ar (Ai) = 0, thus_Rjv (AJ) = 0. 

Consider now A^ := A^ x S 1 *' 1 C A, which is a Borel set of IR^-jO} . Clearly 
then by Theorem 59, Arjv (A^) = 0, but A^v C A*^, implying Arn (An) = 0. 

Consequently (110) exists a.c. in a; with respect to Arn on A. 

We give the following fundamental representation result. 

Theorem 64. Let v > 7 + 1, 7 > 0, n := \v\ , F : A -> M wii/i Fell (A) . 

isswme tftat F(-u>) G AC n ([R 1 ,R 2 \) for all uj G S N -\ and that 9 ' R ^fJ"" ) € 
L oa (R 1 ,R 2 ) for allueS N - 1 . 

d U Tf Fix) 

Further assume that "" i g — € L^ (yl) . More precisely, for these r G 
[i?i,i?2] i for each u) G S N ~ , for which D" R F (ru>) takes real values, there 
exists M 1 > such that \D% Ri F (rw)\ < M 1 . 

We suppose that — i,/ = 0, j = 0, 1, . . . , n — 1, for every ui G S . 



Then 

r) 7 „ F(x) 

— nT 



d2 Rl F(x) 



<^-^ = DJ R F (rw) 



^T^y £ (r - t)^ 7 " 1 (D: Rl F) (tu) dt, (112) 

tr«e \/x G A, i.e. tr«e Vr G [J?i,i?2] «^^ Vw G S N_1 , 7 > 0. 
-Here 

D^ i F(. w )e^C([i? 1 , J R 2 ]), (113) 

VojeS N -\j>0. 

Furthermore 

92 Rl F(x) 



., . G Loo (A) , 7 > 0. (114) 



in particular, it holds 



F(x) = F (rcu) = J- T ( r - if" 1 (l>: Bl F) (tw) dt, (115) 

true Vx G A, i.e. true Vr G [R\,R2\ and \/ui G S N ~ 1 , and 

F(-w) G AC([i?i,fl 2 ]), VweS"" 1 . (116) 

Proof. By our assumptions and Theorem 16, Corollary 14, we have valid 
(112) and (115) . Also (113) is clear, see [5]. Property (116) is easy to prove. 
Fixing r G [Ri, R2] , the function 

6 r (t,w):=(r-t) v -' r - 1 D: Rl F(tu) 
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is measurable on 



[i? 1 ,r]x5 JV - 1 ,B [JJl , r] xB 



S N-1 



Here B\R ltr ] x B s n-i stands for the complete cr— algebra generated by <BrR 1)T .] x 
BgN-i, where Bx stands for the completion of fix- 
Then we get that 



\6 r (t, u>)\ dt ) duj 



fli 



d: Rl F(x) 



dr v 



(r - tY^- 1 \D v , Ri F{tu) | dt) dto < 
■i / 

(r-t)"~ 7-1 <ft| ,L; 






fli 



(117) 
(118) 



2n N / 2 \ (r - fix)"' 7 

!Ar(iv/2) J (1/-7) - 



d* Rl F(x) 



dr v 



oo.dfii.rjxS"- 1 ) 

277^/2 ^ (R 2 - i?r)^ 7 
oo.A 



T(N/2)J (i/- 7 ) 



< oo. 



(119) 



Hence <5 r (t, w) is integrable on 



xfi c 



Consequently, by Fubini's theorem and (112) , we obtain that Dl R F (rui) , 
i/>7+1, 7>0is integrable in o> over (S^^Bgjv-i) . So we have that 
Dj R F (rui) is continuous in r e [i?i,i?2], Vw e S N_1 , and measurable in 
to e 5 W_1 , Vr e [i?i,_R 2 ] • So, it is a Caratheodory function. Here [i?i,i?2] is a 
separable metric space and S N ~ l is a measurable space, and the function takes 
values in W := RU {±00} , which is a metric space. Therefore by Theorem 
20.15, p. 156 of [1], (D2 R F) (ruo) is jointly (B[ RlyRj2 ] x B s n-i) — measurable 
on [i?i,i?2] x S' Ar_1 = A, that is Lebesgue measurable on A. Indeed then we 
have that 

\D? R F(ruj)\< ' 



< 



iRi 

\W Rl F{-u)\ 



T[y-i) 

{r-tY-<- l \Dl Ri F{tu)\dt 



oo,[R lt R 2 ] 



r(i/- 7 ) 

M x (r-R x ) v - 



;y— 7— 1 



< 



Mi 



r(z/-7) (v-7) r(i/-7-i) 

for all w e S^ 1 and for all r e [i?i, i? 2 ] • 



(r-t) 



(i?2 — #1 



dt < 



(120) 
(121) 



\ v — 1 .. 



T < OO, 
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I.e. we proved that 

\Dl Ri F(ru>)\ <r<co,Vwe S N ~ 1 ,md^r€ [Ri,R 2 ] 



(122) 



D 



Q~l F{x) 

Hence proving * R Q r -, — G L^ {A) , 7 > 0. We have completed our proof. 



5 Main results on a spherical shell 
5.1 Results involving one function 



We give 

Theorem 65. Let v > 7$ + 1, 7^ > 0, i = 1, . . . ,1 € N, n := \v~\ , and 
< 71 < 72 < 73 < . . • < 7;. Here f : A — ► R is as m Theorem 64- Let Ti > : 

Z)i=l r i = P- J/ 71 = we sei r l = 1- ^ ei s l> s 'l > 1 : 57 + F = 1> anrf s 2, 

s' 2 > 1 : — + 4- = 1, and p > s 2 , TV > 2. Denote 



and 



Also call 



The 



Qi{R 2 



O2 (i? 2 ) := 



' pCJv-iX+i (jv-i) s ' 1+ i \ V«i 
■n-2 - K l \ 



R. 



(JV-l)«i + l 



(l-JV)^ + l (l_JV)^+l 



P/ s 2 



(1-N)f + 1 



p- s 2 
a := . 

PS 2 

C:=Q 1 (R 2 )Q 2 (R 2 ) 
l = \ I (T (v - rip (v - v - 1 + <r) r * 

(R 2 - fll) (£U(^-D^^-i) 
EL^^-i^iUs^-iUi^' 



n 



n 



c 



a.v/(*) p 



cfcc < 



dx. 



(123) 



(124) 



(125) 



(126) 



(127) 
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Proof. Clearly here / (-ui) fulfills all the assumptions of Theorem 24, \/oj <G 

s N -\ 

We set there qi (r) = q 2 (r) := r^ 1 , r € [i?i , i? 2 ] • 
Hence by (30) we have 



R 2 l 



'U 



SVM 



C I r 
'Hi 



N-l 



dX R J{rw) p 



dr < 



dr v 



dryueS 



N-l 



(128) 



Therefore it holds 



ii. 



i 



„N- 



'W 



Ri 



C I \ I ' 

IS™- 1 \JR 



N-l 



d: Rl f(ru) p 



dr du> < 



Q r v 



dr du>. 



(129) 



Using conclusion of Theorem 64 and Proposition 60 we derive (127) . □ 

We continue with the following extreme case. 

Theorem 66. Let v > 7$ + 1, 7, > 0, i = 1, . . . ,1 € N, n := [V| , and 
< 7i < 72 5: 73 < • • • < 7z- .ffere / : A -» K as in Theorem 64- Let r^ > : 
Z)»=i r i — r - -V 7i = we set n = 1, N > 2. 

Call 



M := ^ ^-^) I : „. 

nil (r {v - 1i + l)) n (rv Eli r i7 i + 1 , 



(130) 



Then 



n 



svw 



MM[ 



2n N / 2 



dx < 



r (AT/2) ' 



(131) 



Proof. Clearly here f (-u>) fulfills all the assumptions of Theorem 26, Vu e 

s N -\ 

We set q(r) = r™" 1 , r <= [i?i,i? 2 ] • 
Hence by (35) we have 



r R 2 l 

/ r N -^\{\D^ R J(r^dr< 
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R%- 1 {R 2 -R 1 ) ru -^=^" H+l 

ni=i(r(i/-7i + i)) ri f"/-EU»-i7i + i; 



d* R J(-u) 



di 



Hence 



fii 



<MM[, VweS^ -1 . 

l i\\ D TRj(™)\ ri dr\dLU< 



R 2 l 



MM{ 



2tT N ^ 



oo,[fli,fl 2 ] 

(132) 



(133) 



r (n/2) ' 

Using the conclusion of Theorem 64 and Proposition 60 we derive (131) . □ 



5.2 Results involving two function 



We give 

We need to make 

Assumption 67. Let f > 7, + 1, 7» > 0, i = 1, 2, n := \v\ , f\, / 2 : A — ► R 

with /i, / 2 G Li (A) , where A := B (0, i2 2 ) - 5(0, fli), < i?i < i? 2 . Assume 
that /i(-w), / 2 (-w) G AC n ([i?i,i? 2 ]) for all w e S^" 1 , and that d " R y fi( '~'\ 



dr u 



e Loo (Ri, R2) , for all to e S 



N-l. 



1,2. Further assume that 



VjH 



dr v 



Loo (A) , i — 1,2. More precisely, for these r € [i?i, i? 2 ] , Vw e 5^ *, for which 
D" R fi (rev) takes real values, there exists M$ > such that 



\D^ Rl fi(ru>)\ <M t , for i= 1,2. 



(134) 



We suppose that 



ff/i (gig) 
dri 



0,j = 0,l,...,n- 1, 



VweS*- 1 ; i = 1,2. 

Let A y > 0, and A a , A^ > 0, such that A„ < p, where p > 1. If 71 = we set 
A Q = 1 and if 7 2 = we set A/3 = 1, here N >2. 

Assumption 67 . (continuation of Assumption 67) 

Set 

/■in 

(w-tt-^- 1)pl( v- l) t^)dt, 



Pk(w) 
k= 1,2, R x < w< R 2 , 



A , s w ( Ar - 1 )( 1 -^)(p 1 H) A °( E ^)(p 2 H) A ^^) 

A(w) := j j 

(I>- 7l )) A "(T>- 72 ))^ 



(135) 



(136) 
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A (R 2 



(P-A„) 



(A(w)) 



p/(p-K) 



dw 



Ri 



(137) 



We present 

Theorem 68. All here as in Assumption 67, 67 , especially assume X a > 0, 
\fj = and p = X a + X v > 1. Then 



,9 r 7l 



dr v 



dl Rl f2(x) 



dr~»i 



d: Rl h(x) 



Q r u 



A„ 



dx < 



(Ao (R 2 ) U, =0 ) 



9: Rl h(x) 



dr h 



(¥) 



^/jW 



dr 1 - 



cfe. 



(138) 



Proof. We apply here Theorem 30 for every u) G S , here p{r) — q (r) = 
r N_1 , r e [i?i, i? 2 ] • Use of Theorem 64 and Proposition 60. So proof is similar 
to the proof of Theorem 65. □ 

It follows the counterpart of the last theorem. 

Theorem 69. All here as in Assumption 67, 67 , especially suppose X a = 0, 
A/3 >0, p = X v + Xf3 > 1. 

Denote 



Then 



2 WA, -1, ifX >X„, 
1, «/ A/? < X v . 



d? R j2{x) 



Q r l2 



% Rl fi(x) 



%,/iW 



a r 72 



^ 

8^/aW 



dr" 



dx < 



(A (R 2 )\ Xa =o)-2 X e /p [ — 
P 



(if) 



;A„/p 



Q r v 



Q r v 



dx. 



(139) 



(140) 



Proof. Based on Theorem 31, similar to the proof of Theorem 68. □ 
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Theorem 70. All here as in Assumption 67, 67 , especially suppose X p 
X a , Xfj > 0, p = X a + Xp + X v > 1. 
Denote 



Then 



L 



* := 2l 



,9 r 7l 



1, if X a + Xp > X v , 
if X a + Xp < X v . 



x ' 8&M X ) a " 9 ,V/iW 



%,/iW 



3 r 72 



a r 72 



^/ 2 W 



A, (#2 



dr^ 1 
(K/p) r 



Qr^ 



^/ 2 (x) 



r9/ 



<ix < 



(A Q + Xp)p 

d: R Ji(x) 



Q r u 



A A„/ P + 2 (A Q +A,)/ P( ~ iA ^ ) A„/ P 

d: Rl f2(x) p 



Q r v 



dx. 



(141) 



(142) 



Proof. Based on Theorem 32, similar to the proof of Theorem 68. □ 
We give the next special important case 

Theorem 71. All as in Assumption 67 without A„ there. Here 72 = 7i + 1, 
X a > 0, Xp := X a+ i <G (0, 1) , and p= X a + A Q+1 > 1. 
Denote 

2 (A Q /A Q + 1 ) _ j if K > Aa+l 

1, v A a < A a +i, 



and 



Then 



ft* 



L(R 2 ):= 



3 (1-A a+1 ) 

(W-Aa+l) 

nCi-Ac+i) 



^2 



-«1 






P 1 (R 2 ):= / " (i?2-t) (L/ - 7l - 1)p/(p - 1} i(^)dt, 



Hi 



$ { R 2 ):=L{R,)( ^0^1 \v-\ 



d: Rl h{x) 

dr^ 1 






(r(^-7i)f 

<9 r 7l + l 



ff£7i(*) 

9 r 7l + l 



dx 



(143) 



(144) 

(145) 
(146) 
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<$(i? 2 ) / 

J A 



d: Rl h{x) 



Q r v 



WrMx) 



dr v 



dx. 



(147) 



Proof. Based on Theorem 33, similar to the proof of Theorem 68. □ 

We give an L^ result on the shell. 

We need to make 

Assumption 72. Let v > 7; + 1, 7* > 0, i = 1, 2, n := \v\ , /i, / 2 : A — ► M 

with /i, / 2 e Li (A) , where R N D A ■- B (0, R 2 ) - B(0,i?i), < i?i < i? 2 , 
N > 2. Assume that /i (-w) , / 2 (-w) G AC™ ([i?i,i? 2 ]) for all to £ S^" 1 , and 

that 9 * Rl d i r i }' U)) e Loo (-Ri, i? 2 ) , for all w € S^" 1 ; z = 1,2. Further assume that 

d ' R ^t {x) € ioo (A) , i = 1,2. More precisely, for these r e [i?i, i? 2 ] , Vw e S^" 1 , 
for which D^ /, (rw) takes real values, there exists M, > such that 



|^*V/i(M| <-Mi, fori = 1,2. 



(148) 



We suppose that 



ff/i (gig) 
dri 



0,.7 = 0,1, 



,n- 1, 



Vwe5 



JV-l. 



1,2. 



Let A„, A Q , A^ > 0. If 71 = we set X a = 1 and if 7 2 = we set A^ = 1. 

We present 

Theorem 73. All as in Assumption 72. 

Set 

T{R 2 -R 1 ) 



R 



N-l 



(v\ a - 7i A Q + vXp - 7 2 A,3 + 1) 

(R — R )( uXa — 'i A =+ J/A /3-72A,- < + l) 

(i>-7i + i)) AQ (r>-72 + i)) A0 ' 



(149) 



Then 



d: Rl h{x) 



9rTi 



dT Rl h{x) 



a r 72 



d: Rl h(x) 



dr u 



%/iW 



Q r l2 



9: Rl h{x) 



a r 7i 

<T(R 2 - Rj. 



%rM*) 



Q T v 



dx 



7T N / 2 

T (AT/2) 



M 



2{\ a +X, / ) 



M? X " + M. 



2\ fi 



M. 



2{\ a +\„) 



(150) 



r N -\ r e 



Proof. Apply Theorem 38 for every u; e 5 fJV_1 , here p(r) 
[i?i,i? 2 ] . It goes as the proof of Theorem 66. Finally use Theorem 64 and 
Proposition 60. □ 
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5.3 Results involving several function 



We need to make 

Assumption 74. Let v > 7* + l, 7, > 0, i = 1,2, n := \v\ , /_,- : A 



I with 
/j GL 1 (A),j = l,...,M,MeN, where R^D/l^B (0, i? 2 ) - B (0, i?i)", < 
Ri < R 2 , N > 2. Assume that f d (-to) G AC" ([i?i,i? 2 ]) for all lu <G S^ -1 , and 



that 



Or 1 ' 



e ioo (Ri,R 2 ) , for all w e 5 W 1 ; j = 1, . . . , M. Further assume 



that "" R g^ x "' e Loo {A) , j = 1, . . . , M. More precisely, for these r G [i?i, i? 2 ] , 
Vw € S^ -1 , for which D" R fj (rui) takes real values, there exists Mj > such 
that 



\D: R J 3 (roj)\<M v for j = l, 



,M. 



(151) 



We suppose that 



Q r k 



0,/c = 0,l,...,n-l, 



Vw€S N ~ 1 ij = l,...,M. 

Let A„ > 0, and \ a , Xp > 0, such that A„ < p, where p > 1. If 71 = we set 
A Q = 1 and if 72 = we set A/3 = 1 . 

We give 

Theorem 75. Lei fj,j = 1, . . . , M, as in Assumption 74- Let X u > 0, and 
X a > 0; X f j > 0, p := X a + X u > 1. Set 



/■if 

P k (w):= {w-t) {v - rfh - 1) ^t(^)dt, 



fli 



fc= 1,2, i?l < w< i? 2 , 



AH 



/^K^j (fi H) A °(^) (p 2 H)^(^) 
(i>- 7 i)) Aq (i>-7 2 ))^ 



A) (#2 



Tafce the case of Xp = 0. Then 



11, 



{A(w)) p,x " dw 



A Q /p 



Ri 



(152) 

(153) 
(154) 



M 

E 



%,/,(») 



9rTi 



^/;(») 



Q r v 



dx 



<(A,(* 2 )U,=o)(^) 



(¥) 



M 

E 



<9r" 



f/.r 



(155) 
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Proof. As in Theorem 68, based on Theorem 45. □ 
We continue with 

Theorem 76. All basic assumptions as in Theorem 75. Let A„ > 0, A Q = 0; 
A/3 > 0, p := \ v + A^ > 1, P 2 defined by (152) . 
Now it is 

.. . _ w ( JV - 1 )( 1 -^)(P 2 ( w )) A ' 3 ( E f 1 ) 

A(w) : — r , 

(i>- 72 ))^ 



Denote 



Call 



The 



R-2 



-Wp 



\p/*p 



A (R 2 ):=U (A(w)r^dw 



2^/^-1, z/A^>A„ 
1, if A/? < A„. 



^II^^A^R,}-!"' '■(^>j KP S^ /p . 



^ ir ^/i + iW X > d: Ri fj(x) A " 



E 



9 r 72 



^ r 72 

A/3 



d"* Rl f j+ i(x) 



dr v 

A 



^ 



<9 r 72 



Q r 72 



2^2 (P 2 ) 






M 

E 



9r^ 



dx < 



dr" 



dx 



(156) 
(157) 

(158) 

(159) 



(160) 



Proof. As in Theorem 68, based on Theorem 46. □ 
We present the general case 

Theorem 77. All basic assumptions as in Theorem 75. Here \ v , X a , Xp > 0, 
p := X a + A/3 + A,, > 1, Pfc as in (152) , A as in (153) . Here 



A (i? 2 ) 



/?2 



(AW) 



p/(Xc,+Xp) 



dw 



fli 



7i 



-J 2^ 



1, «/ A Q + A/3 > Ay, 
i/ A Q + A/3 < A y . 



(161) 
(162) 
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Put 



Then 



<p 3 (R 2 ) := A (P 2 ) 



K 



{X a + \f})p 



(K/p) 






M-l 

E 



d? R Jj{x) 



a r 7i 



%,/ J+ iW 



^ r 72 



^/i(x) 



Sr 1 - 



%,/i+iW 



Q r 7l 






%,/iW Aq ejMW A ^ ^v/i(x) A - 



Q r l2 

2^3 (ifc) 



<9rT2 



^ R Jm{x) 



d r 7l 
M 



^ r i/ 



dr v 



dx < 



E 






eta: 



(163) 



(164) 



Proof. As in Theorem 68, based on Theorem 47. □ 

We show the special important case next. 

Theorem 78. Let all as in Assumption 74 without X u there. Here 72 = 
71 + 1, and let X a > 0, Xp := A a +i, < A Q+ i < 1, such that p := X a + X a+ i > 1. 
Denote 

2 (A Q /A Q + 1 ) _ 1; if Xa > Aq+1j 

1) if X a < A a +i, 



and 



L{R 2 



T> 1-^c + l 

H 2 



K 



O (1 - Ag + l) 

(iV-Aa+i) 

(1-Aa+l) 



A Q +i 



?3 A Q + 



^a + A Q + i 



.JJ 2 

P (i? 2 ) := / (i? 2 - t f-^- 1} (^) t(^r)dt, 



$ (P 2 ) := L (P 2 



' fiQg2)____j 

(i>- 7 i)) p 



2 (p- 



(165) 



(166) 

(167) 
(168) 
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The 




dXfjix) 



<9rTi 



dTnTfj+i (*) 



Qrii+i 



%,/i+iW 



9 r 7l 



%r/i w 



%,/iw 



d r 7i 



9 r 7l + l 



%T7mW 



9 r 7l + l 




cfcc < 



(169) 



Proof. As in Theorem 68, based on Theorem 48. □ 

We study the L^ case next. 

We need to make 

Assumption 79. Let v > 7, + l, 7, > 0, i = 1,2, n := [V] , /j : A 



1 with 

/j e ii (A) , j = 1, . . . , M, M e N, where R" D A := B (0, R 2 ) - B (0, R{), < 
i?i < R 2 , N > 2. Assume that fj (-w) g AC™ ([i?i,i? 2 ]) for all w £ S^" 1 , and 



that 
that 



^ Rl /i(-") 



dr v 



K Rl fj (a) 



€ £oo {R11R2) , for a U ^ <= S 1 ; j = 1, . . . , M. Further assume 
G Loo (A) ,,7 = 1,..., M. More precisely, for these r E [R\, i? 2 ] , 



Vw <G S 1 1 , for which D" R fj (rui) takes real values, there exists Mj > such 
that 

I D: Ri fj (rw) I < Mj , for j = 1, . . . , M. (170) 

We suppose that 



d k fj (Rnv) 
dr k 



0,fc = 0,l,...,n- 1, 



VweS N - 1 ;j = l,...,M. 

Let A„, A Q , A/3 > 0. If 71 = we set A Q = 1 and if 72 = we set A/3 = 1. 
The last main result follows. 
Theorem 80. All as in Assumption 79. 
Set 



T(R 2 ):= 



R 



N-l 



(v\ a - 7!A Q + z/A/3 - 7 2 A /3 + 1) 
(L(^-7i + l)) A "(L(z,- 72 + l)) A ^ 



(171) 
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The 




W "' r d^fiix) A " d? R J j+1 (x) A " d» R Jj(x) A 



9rTi 



9 r 72 



Q r u 



d?nJAx) 



Q r ~12 



%,/i+iW 



<9rTi 



^V/.+i (*) 



<9r^ 



%,/iW 



dr-71 



^\/m(x) 



Q r 72 



^VA(x) 



Q r l2 

2n N / 2 
T (AT/2) 



^/mW 



dr^ 



M-l 



KrJm(x) 



dr v 



dx < 



T(R 2 )l Y,{ M f 



A Q +A„ 



m; 



j=i 



(172) 



Proof. Based on Theorem 52; here p(r) 



„iV-l 



G [i?i,i? 2 ], apply (97) 



Vw € S^ 1 . It goes as the proof of Theorem 66. Finally use Theorem 64 and 
Proposition 60. □ 



6 Applications 



We need 

Corollary 81. (to Theorem 68, /i = f 2 ) All as in Theorem 68. It holds 



%,/iW 



07-71 



(A (A 



2 J Afl=0 



A Q 


dr v 


dx < 


\(^)/y 


VrM*) 


J 


KJ A 




dr v 



dx 



(173) 



So setting X a = X u = 1, p = 2 m (173) , we obtain in detail 
Proposition 82. Let v > 7 + 1, 7 > 0, n := \v\ , f : A -> M with / G 
Li (A) , where A := B (0, R 2 ) - B(Q,R{) C R N , N > 2, < R x < R 2 , . Assume 

that f(-Lu) G AC n ([Ri,R 2 ]) , Vw G S^- 1 , and that ^j/j"" G ioo (i?i,i? 2 ) , 

Vcj G S^ -1 . Further assume that * g 1 ^ — G Loo (-4) ■ More precisely, for these 
r G [i?i,i? 2 ] , Vw G S^' 1 , for which b» R J (ru) takes real values, 3 M x > 
such that 

|^ fll /(rw)|<Mi. 
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Suppose that 



ay (gig) 

dri 



= 0,j = 0,l,...,n-l, VweS 



AT-l 



Set 



P (r) := / (r - i) 2 ^" 1 ) jC 1 -^, ^ < r < R 2l 
'Hi 



A(r) 
A (R 2 ) := 



r ( i V) v /p( r ) 
i>-7) ' 



1/2 



(A (r)) 2 dr 



(174) 
(175) 
(176) 



Then 



5rT 






da; < 



^w/.(%^y 



rfx 



(177) 



When 7 = we get in detail 

Proposition 83. Le t v > 1, n := |V] , / : 3 -> M with / G ii (A) , 
where A := B(0,R 2 ) - 5(0, i?i) C l", AT > 2, < i?i < i? 2 , • Assume 

that /(■««;) G AC" ([i?i,i? 2 ]) , Vw G S^" 1 , and that ^j/j'" G Loo (i?i,# 2 ) , 



Vcj G 5 . Further assume that 



gggi/te) 



Q r u 



G ioo (A) . More precisely, for these 
r G [-Ri,i? 2 ] , Vw G S^ -1 , for whichl);^ / (rw) takes real values, 3 Mi > 
such that 

I^v/mI^Ml 

Suppose that 



dri 



Set 



Po (r) := / (r - i) 2 ^ 1 ' i^^di, R^rK R 2 , 



A (r) := 



A^^Mrj 



r» 



i?. 2 



1/2 



4>(fl 2 ):= ( / (A(r)) z dr 

'-Rl y 



(178) 
(179) 
(180) 



Then 



l/(*)l 






dx < 
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2 °(^) 2 - 1/2 (/,(%^) **)■ ( 181 ) 

Based on Corollary 35 we give 

Proposition 84. Let v > 7 + 1, 7 > 0, n := \v\ , f : A — > R with / <G 
Li (A) , where A := B (0, R 2 ) - 5(0, i?i) C R^, N > 2, < R x < R 2 . Assume 

that f(-Lo) e AC n {[Ri,R 2 ]) , Vw e S N ~\ and that ^ff^ e Ax> (ili.ik) , 

VweS*- 1 . 
Suppose that 

d j f{R!U>) . N _j_ 

— — 3 = 0, j =0,l,...,n-l, Vwe 5 



Then 



1) / |i>: fll /M||JW(M|*< 

./Ki 



(r - i?!) 



(;/- 7 ) 



{D^ R J {tuj)) 2 dt\ , all £1 < r < R 2 , Vw e S^" 1 . (182) 



2) When 7 = we get 

\f{tu)\\D: R j{tu)\dt 



< 



Hi 



(2r<^ro)(/>- /M)2 4 (i83) 



alii?! <r<i? 2 , VweS^- 1 . 
In particular we have 



/•-R2 
3) / |/(rw)| |Dr fll / M| dr< 

(^^^LJ) ( r (D: R J(ru;)) 2 dr) , Vw e S*" 1 . (184) 

^t^^/jV^^t; \J Rl y * RlJy ' J 

Next we apply Proposition 84, see (183) for proving uniqueness of solution 
in a PDE initial value problem on A. 

Theorem 85. Let v > 1, v (£ N, n := [v] , f : A -> R with f e ii (A) , 
w/iere A := B(0,R 2 ) - B(0,i?i) C 1 N , N > 2, < R x < R 2 , . Assume 

that f(-u>) E AC n ([R!,R 2 }),yuje S N ~\ and that a '% J J"" ) e AC([i?i, i? 2 ]) , 

Vcj G S N_1 . Further assume * R Q rV — € £00 (-4) , suc/i i/iai i/iere exists M\ > 
tuitft |-D^/M| < M i> Vr e [-Ri,-R 2 ] , Vw e S 1 *' 1 . Suppose that 



cM 



= 0,j = 0, l,...,n-l, Vwe S 1 ' 
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Consider the PDE 

1 * RlJ y ' ] - * (x) f (x) , (185) 



dr \ dr u 

\/x € A, where ^ 6 : A — > R zs continuous. If (185) /ias a solution then it is 
unique. 



Proof. We rewrite (185) as 

d (d: R J(rw) 



dr V dr" 



'M/(r«), (186) 



valid Vr e [Ri,R 2 ] , Vw e S 1 *' 1 and ^ 6» : ([Ri,R 2 ] x S^" 1 ) -► R is continu- 
ous. 

Assume /i and /2 are solution to (185) , then 

d~r \ 8r~- ) = (W) h (rw) ' ( 8?) 

and 

d (&i R f 2 (ru>)\ 
Wr ( g r : > )=e{ru:)h{ru), (188) 

Vre [i?i,i? 2 ], VweS^" 1 . 

Call g := /i — f 2 , thus by subtraction in (187) we get 

d (din g(ru>)\ 

I * R ' y{ s - fl MsM> (189) 



JV-l 



9r \ Sr^ 
Vr e [i?i, i? 2 ] , Vw e S^ -1 . Of course 
tfg (R x uj) 



, hj -0,j = 0,l n-l,Vwe5 

Consequently we have 



Sr^ ) dr \ dr u 

»H ff H(%^], (190) 



Vre [i?i,i? 2 ], VweS"- 1 . 

Hence 

'^,JN\ 9 (dX Ri g{tuj) 



Rl \ dr v J dr \ dr v 



(II 
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Vre [R 1 ,R 2 ],\/uGS 
Therefore we find 



■N-l 



( 



>Ri 



(&i R g(tu>)\ 
'NgN * Rl dr l - )dt, 



Ri 



zN-l 



Notice that ^"ff 1 " =0,Vwe S^- 1 , see (iiO) . 



Consequently we find 






= 2 



iRi 



'NsN — ^ — 



I rfi 



<2||ff|| 



/Hi 



la (Ml 



ff fll g(M 

dr 1 ' 



fit 



(^ f l|g|loo(r-iiir ' 



IR 



(D: Ri g(tuj)) 2 dt\< 



AR*-Ri) 



Vr e [Ri,R 2 ], Vwe5 
Call 



JV-1 



, (g2 - glT 



A: := '' """ - — , -^- 

r (1/) ^^217^1 

So we have proved that 



>0. 



(192) 



(193) 
(194) 



■)(f R ( D *Rr9(^)) 2 dty (195) 



(196) 



^t^ ) <*(j£(*WM) 2 *), (197) 



Q r u 

Vr G [Ri,R 2 ] , Vw G S N -\ Here ^^(-w) G C([i?i, R 2 ]) , Vw G S"" 1 . 

Hence by Gronwall's inequality we get (D" R g (rw)) = 0, so that D" R g (rut) 

0,Vr G [iZi.iJa], Vw G S N ~K Thus £ (^^) = 0,Vr G [i?i , Jfc] , Vw G 

S^" 1 . And by (188) we have 6 (no) g (no) = 0,Vr G [i?i,i? 2 ], Vw G S*" 
implying g (no) = 0, Vr G [i?i, i? 2 ] , Vw G S^" 1 . 

Hence proving /1 (no) = / 2 (rw) , Vr G [i?i, i? 2 ] , Vw G S^ -1 . Thus 

/i(a;) = /2(x),VxG3 ) 



r-iV-1 



hence proving the claim. □ 
We give the very important 
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Remark 86. From Corollary 12 we saw that: for v > 0, n := \v\ , f £ 
AC m ([a,b}), given that D v a f (x) exists in R, Vx g [a, b] , and / (fe) (a) = 0, 
fc = 0, 1, . . . , n— 1, imply that D" a f — D v a j. Also we saw in Theorem 16, 17, that 
by adding to the assumptions of Theorem 16 that " there exists D^f (x) G R, 
V.x G [a,b] ", we can rewrite the conclusions of Theorem 16, that is getting 
the conclusions of Theorem 17, in the language of Riemann-Liouville fractional 
derivatives. Notice there, under the above additional assumption that also holds 
D2f(x) = Dl a f(x),Vxe[a,b]. 

Theorem 16 is where is based the whole article. 

So by adding to the assumptions of all of our results here for all functions 
involved that " there exists D v a f (x) G R, Vx G [a, b] " we can rewrite them all 
in terms of Riemann-Liouville fractional derivatives. Accordingly for the case 
of spherical shell we need to add " there exists D^ f (rui) € R, Vr € [i?i, R2] , 
for each u> G S' Ar_1 ", and all can be rewritten in terms of Riemann-Liouville 
radial fractional derivatives. 

So as examples next we present only few of all these can be rewritten results. 

We present 

Theorem 87. Let v > 7+1,7 > 0. Call n := \v~\ and assume f G 
AC m ([a, b}) such that / (fc) (a) = 0, k = 0,1,..., n- 1, and 3D v a f {x) G R, 
V.x G [a, b] with D v a j G L^ (a, b) . Let p, q > 1 such that - + - = 1, a < x < b. 

Then 

' X \D2.f(u;)\\(D:f)(u;)\dcj< 

PV—P7—P+2 

(x — a) p 



(0,)T(u-j) {{pv-py-p+ 1) (j>v - P7 - p + 2)) 



i/„ 



\D';j^)\"du) " . (198) 



Proof. Similar to Theorem 18. □ 

The converse result follows. 

Theorem 88. Let v > 7+1,7 > 0. Call n := \v\ and assume f G 
AC n {[a,b\) such that f {k) {a) = 0, k = 0,1,..., n- 1, and BD»f(x) G R, 
Va; G [a, b] with L) v a j, jj^j G £00 («, b) ■ Suppose that D v a f is of fixed sign a.e in 

[a, b] . Let p, q such that 0<p<l,q<0 and — I — = 1, a < x < b. 
Then 

[ X \DZf(u)\\D:f(uj)\du> 

J a 

pv-p-y-p+2 

(x — a) p 
(</2)T{v- 1 ){{p V -p 1 -p+l){p V -p 1 -p + 2)) 1/p 

■^£\DU^)\ q d^ \ (199) 
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Proof. As in Theorem 20. □ 

We present 

Theorem 89. Let v > 7$ + 1, 7, > 0, i = 1, 2, n := \v\ , and f\, /a € 

AC" ([a,b]) such that f[ j) (a) = / 2 (j) (a) = 0, j = 0,1, . . . ,n - 1, a < X < b. 
Consider also p(t) > and q(t) > 0, with all p(t), -k*, q(t) E L OQ (a,b). 
Further assume 3D^fi (x) G K, V.t € [a, 6] and Da.fi € Loo (a, 6) , i = 1,2. Lei 
A y > and A Q , A^ > suc/i i/iai A^ < p, where p > 1. 

//ere P& is as m (39) , A (u>) is as in (40) , Aq (x) as in (41) , 5i as in (42) . 

If A/3 = 0, we obtain that 



q{u>) 



l^/iMI^Pa/iMI^ 



IDy/2 Ml Aa 1^/2 Ml A - 



(A (3) 



Afl=0j 



A, 



du) < 

A./p 



A Q + A„ 



01 



( A^) 



(200) 



Proof. As in Theorem 30. □ 

Corollary 90. (All as in Theorem 89, \p = 0, p (t) = q (t) = 1, A Q = \ u = 
1, p = 2.) In detail: 

Let 1/ > 71 + 1, 71 > 0, n := M , f u f 2 e AC n ([a, b}) : f[ j) (a) = / 2 (j) (a) = 
0, j = 0, l,...,n - 1, a < x < b; 3D%fi{x) <= R, V.t e [0,6] wrt/i D£/j € 
Loo (a, 6) , i = 1, 2. TTien 



(201) 



[|(l^/i)MII(^/i)MI + 
K^/2) Ml K^/2) Ml] du< 

' (x - a)^-^ * 

2r (v - 7l ) y^r7V2^ -271-1 



((^/l)M) +((^a/2)Mr 



rfw 



all a < i < 6. 

We need 

Definition 91. Let F : A -> R, v > 0, n := \v\ such that F (-u) G 
AC™ ([i?i,i?2]) , for all w e S ,JV_1 . We call the Riemann-Liouville radial frac- 
tional derivative the following function 



dr v 



1 



d" 



r (n — v) dr r 



(r - t) n ~ 1/ - 1 F (M dt, 



(202) 
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where x E A, i.e. x = rui, r € [R\,R 2 ] , W € 5^ x . 
Clearly 

d% F (x) 

and 



dr° 
d Ri F(x) _d"F(c 



Q r v 



Or 



-, if i/eN. 



We give 



vve give 

Proposition 92. Let 1/ > 7 + 1, 7 > 0, n := \v\ , f : A -> K with / G 
Li (A) , where A := B (0, i? 2 ) - B (0, i?i) C R N , N > 2, < i?i < i? 2 . Assume 

that /(-w) G ^^"([i?!,^]), V^ G S^" 1 , and that %^ G L X (R U R 2 ), 
Vcj G S^ -1 . Further assume that 3D R J (ruj) G R, Vr G [i?i,i? 2 ], for each 

w G S^- 1 , with a \ / ? ) G Loo (A) . We suppose Vr G [i?i, R 2 ] and Vw G S^" 1 
that 3Mi > such that \D R J (ru)\ < M x . 
Suppose that 



Set 



also 



d J f(RiUj) . Ar _ 1 
^- =0,j = 0,l,...n-l,Vo;eS 

P (r) := f (r - i) 2 ^" 1 ) ^-^di, ^ < r < R 2 , 



A(r) 
A (R 2 ) := 



r(i/- 7 ) ' 

rfl, \ 1/2 

(A(r)) 2 rfr . 



'-Ri 



(203) 

(204) 
(205) 



Then 



dlj(x) d R J{x) 




(206) 



Proof. Similarly as in Proposition 82. □ 
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Asymptotics for Szego polynomials with respect to 
a class of weakly convergent measures 

Michael Arciero 1 , Lewis Pakula 2 
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2 University of Rhode Island, Kingston RI, 02881, pakula@math.uri.edu 

Abstract 

Recent results of the author characterize limits for Szego polynomials of fixed degree 
k with respect to measures which are weakly convergent to a sum of m < k point masses, 
with the measures formed by convolving the point masses with the Poisson and Fejer 
kernels. Moreover, the limit polynomial is seen to be the same in each case. Here, we 
show that the Poisson kernel can be expressed as a convex combination of Fejer kernels. 
Conjectures are made for a general class of kernels whose Fourier coefficients fi>(£) form 
convex functions of £. 

Keywords: Szego polynomial, orthogonal polynomial, frequency analysis, Poisson ker- 
nel, Fejer kernel. 



1 Introduction 

Given a measure, [i, on the unit circle, the Szego polynomial of degree k with 
respect to ft, which we denote Pk{z,fi), is the polynomial in the complex 
variable z which attains the minimum 

min f \p(e ie )\ 2 d^9) = f |P fc (e^,/i)| 2 d^9), (1) 

P £A fc J-7T J-K 

where Aj, is the set of monic polynomials of degree k. The Szego polynomials 
with respect to a measure /x form an orthogonal sequence, are uniquely de- 
fined if the degree is less than the number of points on which \x is supported, 
and can expressed as a ratio of matrix determinants or generated recursively 
using Levinson's recursion. Szego polynomials have many applications and 
have been studied widely. See [4, 6, 10, 11] for background. Some results 
related to frequency analysis appear in [5, 7, 9, 8]. The motivation for the 
use of Szego polynomials in frequency analysis is loosely based on the ob- 
servation that the spectral measure of a digital signal with strong sinusoidal 
components will be heavily weighted at the frequency locations 6j, and in 
light of (1), one would expect Pk(z,fj,) to have zeros near e t0j . Note that for 
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Table 1: Poisson and Fejer kernels 



kernel 


density 


moments iph{£) 


h 


Poisson: 
Fejer: 


MO) = 
MO) = - 

n 


l_ r 2 




r \i\ 
n 




"sin(n#/2)~ 
_ sin(0/2) _ 


2 


1 
n 



supp(fi) = m < k, Pk(z, jj) is not uniquely defined since any polynomial with 
m zeros at the point mass locations will attain the minimum of zero in (f ). 
In [I, 2] we consider measures formed by convolving the Poisson and 
Fejer kernels, respectively, with the sum of point masses y^L-, (XjSq., where 
8g. is the point mass measure at 0jthe aj are positive. Both are examples of 
approximate identities iph with h — > as either r — ► 1 or n — ► oo, respectively, 
as indicated in Table 1, where x + = max{x, 0}. It is easy to see that for any 
approximate identity iph, we have the weak-star limit 



lim tl) h * ^ "A = J2 a J 5e 



(2) 



On the other hand, for m < k, the associated Szego polynomials of fixed 
degree k do not necessarily converge. That is, /i^ — > Y^T=i a j$9j does n °t 
guarantee existence of the limit lim^o Pk(z, fih) even if /x^ converges strongly. 
(See [1] for an example.) 

The main point of [1] and [2] is that the Pk(z, VVi*X)j=i a j^8 3 ) do converge; 
moreover, the limit is the same for both kernels. We have the following 

Theorem 1.1 Let iph be either the Fejer or the Poisson kernel with the iden- 
tifications in Table 1. Suppose So is the point mass at = 0j with the 0j 
distinct and aj > for j = 1, 2, 3, ..., m. Then 



lim P k (z, i>h*^2 a Aj) = Pk 



s?, v . 



IF 



(3) 
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where v is the absolutely continuous measure with 

dv 



(W EIN^-^i 2 - (4) 

i =1 p¥=j 



We seek to extend Theorem 1.1 to a larger class of kernels. We consider 
kernels that have moments iph{£) which are convex functions of £ for £ > 0, 
or which can be expressed as a convex combination of Fejer kernels, and make 
a conjecture in each case. The motivation for this is that the Poisson and 
Fejer kernels have moments which are convex functions (though those of that 
latter are not strictly so). Moreover, it is possible to expand the moments of 
the Poisson kernel in terms of those of the Fejer. Specifically, we have 

Proposition 1.1 Let (f> n (9) denote the Fejer kernel for n = 1,2,3,..., and 
define a r ^ n = (1 — r ^ 2 nr n ~ x . Then 



y^Qr,n0nffl 



n=l 



Proof: We show this by writing the above as geometric series. Since n (— I 
4> n (£) we can assume £ > and write 



J2 a rjn(£) = (l-r) 2 £fir n - 1 (l 

n=l 
oo 

1 - rf J2 nrH ~ l - I 1 - r ) 2 Yl lrn ~ X - ( 5 ) 



n 

n=l n=l 



n=l+l n=l+l 



Regarding the first sum in (5), we have 

oo 1 I 



n=e+l y ' n=\ 



(1 — r) 2 dr 

v ' n=0 



' £<"' 
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1 d fl-r 



J+l 



(1 — r) 2 dr \ I — r 
r £ (l + £ - ir) 
(1-r) 2 ' 

Regarding the second sum in (5), we have 

oo / oo 

n=£+l 



(6) 



\n=l 


»i=l 


( 1 


1 — r £ 


Vl-r 


1 — r 


f 





< 7 > 

The sum of (6) and (7) is tjz32 , which, with (5), proves the proposition. 
An immediate consequence of Proposition 1.1 is the following. 

Corollary 1.1 Wit/i <p n and a ryH as in Proposition 1.1 the Poisson kernel 
can be expressed 

1-r 2 -A 

^) = \ e i0- r \2 =22 a rM0) 
n=l 



2 Conjectures for a class of densities 

Let / be a function with the following properties: 
1- f(x) > for all x. 

2. /(x) = /(-*). 

3. / is convex and non-increasing for x > 0. 

Such functions satisfy the Polya criterion and thus are the characteristic 
functions of positive measures. (See, e.g., [3], p482.) We will call {a n } and 
if) a sequence and density, respectively, of Polya type, if (f>(n) — a n — f(n) 
for some function / satisfying the Polya criterion. So the Poisson and Fejer 
kernels are densities of Polya type for < h < 1. We conjecture that Theorem 
1.1 holds for all kernels of Polya type. 



96 ARCIERO-PAKULA: SZEGO POLYNOMIALS 



Conjecture 1 Let fh be a family of Polya type functions for a continuous 
or discrete parameter h on < h < 1 with /^(O) = 1 and lim^o fh(%) — 1 

for all x, and suppose iph is the measure with iphi.0-) = fh(fy- Then (3) and 
holds for the kernel iph, with v given in (4)- 

A variant of Conjecture 1 is motivated by the construction of Proposition 
1.1 and the fact that the Fejer kernel would seem to be a "base case" since its 
moments are linear rather than strictly convex. Indeed, one might suspect 
that any convex function of x can be expressed as a convex combination of 
the functions (1 — -) + . 

Let (p n denote the Fejer kernel as in Table 1 and suppose {a n }~ =1 is 
a sequence of non-negative real numbers with Y^=i a n — 1> then ip(6) = 
Y^Li a n4>n{9) is a density of Polya type. Now suppose that An := {aN, n } 
is a sequence of sequences with Y^Li a N,n = 1 for each iV = 1,2,3, ... and 
liniTv^oo a>N,n = for each n. If the latter holds, we write An — >• 0. We define 
sequences A^ for continuous parameter h — > similarly, and write Ah — *• 0. 
In either case, we simply write A — > 0, and conjecture that Theorem 1.1 
holds for ipA- 

Conjecture 2 Suppose A — *• and 



^a{0) = y^ y ah,n<f>n(0)- 



n=l 



Then (1.1) holds for the kernel ipA', that is, 



3=1 3=1 



%- m [z.i') I 11: -e i8j 



Remarks: 

With a rjU given in Corollary 1.1, A — ► and ipA r is the Poisson kernel, 
while the Fejer kernel corresponds to A N = {0, 0, 0, ..., 1, 0,0, ...}, with 1 in 
the JV-th position. 

We see from Table 1 that the moments of the Fejer and Poisson kernels 
agree to first order in h. The Fejer kernel may thus be thought of as a 
base case in this sense as well. The techniques in [1, 2] do not exploit this 
property, however. It is possible Polya-type kernels are contained in a larger 
class which includes those whose moments agree to first order. 
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Abstract 

An unfolding of a polyhedron consists of cutting the boundary of 
it along some of its edges in such a way that one can flatten out the 
remaining set in the plane in a single piece. An unfolding is a net if 
it does not overlap itself. No convex polyhedra has been found which 
does not have a net, though almost all of its unfoldings overlap. In 
the particular case of the Dodecahedron, we show by means of both 
theoretical and computational considerations that every unfolding is 
a net. 



1 Introduction 

There is an extensive study of unfoldable/foldable structures and ap- 
plications are different and all very interesting. Among them, we 
may mention the Japanese art of paper folding and the utilization 
of unfoldable connected structures in aerospace. See for instance 
http://www.patentgenius.com/patent/6920733.html 

Even very regular polyhedra have an overlapping unfolding, al- 
though it is possible also to construct an unfolding of the polyhedron 
which is a net [3]. Furthermore, Schevon [2] shows for some class 
of polyhedra that almost all unfoldings of a polyhedron in the class 
overlaps. 

In the particular case of the Dodecahedron one can ask if it has 
an overlapping unfolding. The total number of unfoldings for the 
Dodecahedron is 43380. We give an answer to this question by means 
of theoretical and computational aids. See also [1] where a purely 
theoretical approach is considered. 
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(A) 



(B) 



Figure 1: (A): The Platonic solid. (B): Graphic representation of the Dodec- 
ahedron. 

This problem is related to Shepard's conjecture which states that 
any convex polyhedron has at least one net. Up to date there has not 
been found a convex polyhedron for which every unfolding overlaps, 
thus reinforcing the conjecture. 

2 Theoretical framework 

The regular dodecahedron is the Platonic solid composed of 20 ver- 
tices, 30 edges and 12 pentagonal faces. See figure 1 (A). 

Definition 2.1. 

(i) An unfolding of a polyhedron consists of cutting the boundary of 
it along some of its edges in such a way that one can flatten out 
the remaining set in the plane in a single piece. 

(ii) An unfolding is a net if it does not overlap itself. 



Definition 2.2. A single chain is a set of faces belonging to an un- 
folding of a polyhedron and such that anyone of its elements shares 
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(A) (B) 



Figure 2: (A): Unfolding of a polyhedron. (B): Single chain of faces determin- 
ing the overlapping. Obtained from the Wolfram Demonstrations Project. 

at least one edge and at most two edges. The number of faces in the 
single chain is called its length. 

Figure 2 (A) shows an unfolding of a polyhedron, which is not a net. 

2 (B) exhibits the single chain determining this overlapping. In figure 

3 we have an unfolding of the Dodecahedron without overlapping, and 
therefore it is a net. 

Our strategy to solve the problem we stated is quite simple and 
direct. From the lemma 2.1 below, to check if every unfolding is a net 
one needs only to look at all single chains of lengths one to twelve. This 
observation reduces the problem not merely because of the number of 
unfoldings (43380) is greater than that of the single chains (~ 5000), 
but to construct and analyze is easier for a single chain than for an 
unfolding. 

Lemma 2.1. Every unfolding of the Dodecahedron with an overlap- 
ping has a self intersecting single chain of pentagons of length less or 
equal twelve. 

Proof. Suppose that U = { /i , . . . , /12 } is an unfolding of the 
dodecahedron having an overlapping. Let fk and fi a pair of faces of 
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Figure 3: Unfolding of the Dodecahedron which is a net. 

IA with non void intersection and not being adjacent in the unfolding 
IA. Since every unfolding is a connected set, there must exist a subset 
{fki- ■ ■ i ft} of IA forming a single chain of pentagons of length less or 
equal to twelve. This proves the lemma. □ 

Theorem 2.1. If every single chain of pentagons of length less or 
equal to twelve does not intersect itself, then every unfolding of the 
Dodecahedron is a net. 

Proof. Since every unfolding that overlaps contains a self inter- 
secting single chain, from the previous lemma the result follows at 
once. □ 

3 Main Theorem 



Now we will prove that every single chain does not intersect itself. In 
order to show this, we use a computational algorithm that calculates 
and analyzes every single chain of the Dodecahedron. 

Theorem 3.1. Every unfolding of the Dodecahedron is a net. 
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Proof: From theorem 2.1 we need only to check that every single 
chain does not intersect itself. 

The strategy to do it is conceptually simple and is described in the 
next three steps. 

(i) First step: Let 1 < k < 12 and {/i,. ••,/&} denote a single 
chain of the Dodecahedron. We note that for k = 1, . . . , 5 one can 
check just by inspection that no single chain intersects itself. 

(ii) Second step: We proceed iteratively. To check if there is a 
single chain of length 6 that intersects itself, we only have to verify 
that the pentagons at the extremes in the single chain do not inter- 
sect. Precisely, suppose that there is a single chain of length 6 that 
intersects itself. If /$ and fj are not adjacent pentagons of the single 
chain having non void intersection, then there is a subset of pentagons 

{hi, hi, ■ ■ ■ , fe k } ( 2 < k < 6; hi = fu h k = fj) of tne original single 
chain that forms itself another single chain, which we denote by S'. 
This single chain intersects itself. From the fact that no single chain 
of length less or equal to 5 has an overlapping, S' must have length 6. 
Therefore, it is the original single chain and the pentagons overlapping 
are the extremes of the original one. This reduces the problem to check 
wether the pentagons at the extremes in every single chain of length 
6 do not overlap. As a consequence, if we show that the extremes of 
a single chain of length 6 do not intersect, then we would have proved 
that no such single chains have an overlapping. This argument can be 
applied inductively for k = 7, . . . , 12. 

(Hi) Third step: Construction and analysis of single chains. Our 
algorithm calculates systematically each single chain of lengths 6 < 
k < 12 and verifies that there is no overlapping. Therefore, the theo- 
rem is proved □ 

3.1 Description of the algorithm 

We describe the algorithm. The program uses the parameter £=length 
of a single chain. The single chain is represented by a list S of I 
numbers between 1 to 12. By a previous routine we calculate all single 
chains of lengths £ = 6, . . . , 12. The parameter D is a data matrix 
containing the structure of the Dodecahedron, as given in Table 1. 
The algorithm gives as output at a stage a list L of numbers that 
the computer associates to vectors connecting the centers of adjacent 
pentagons. See figures 4 and 9. Here we mean by the center of the 
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Figure 4: Directions 1', 2', 3', 4' and 5' are associated with the vectors 
pointing to the center of adjacent pentagons. 



pentagon, the center of the circle circumscribing the pentagon. This 
fixes the coordinates of the center of each pentagon in the single chain. 
By another routine, we draw the graphic of the single chain S and 
determine if it intersects itself. 
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Table 1. Symbolic representation of the Dodecahedron. 
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Figure 5: Figure (a) shows the orientation chosen for a single chain. Figure 
(b) shows the orientations given for a face "up" and a face "down" , respec- 
tively. 

Note that on symmetry reasons one can assume that all single 
chains start at the pentagon labelled 1. See figure 1. Furthermore, one 
can see by similar arguments that it is enough to analyze single chains 
having faces 1,5,4,. . . and 1,5,10,. . .. The algorithm calculates in a 
previous routine all these single chains. 

We first show how the algorithm works with an example. Suppose 
that we want to analyze the single chain {1, 5,4, 3, 8, 12, 7}. 

Certainly we can suppose that pentagon 1 is placed so that its 
edge d is horizontal. See figure 1. Then pentagon 5 has its edge d also 
horizontal and pentagons 1 and 5 look in an unfolding as in figure 
5(a). It follows that 4 must be attached to 5 at the common edge 
h. See table 1 and figure 1. By simple geometry one realizes that 
pentagon 4 must be attached to 5 in such a way that 4 can be seen 
as a translation (not a rotation) of pentagon 1. Similarly, pentagon 3 
must be attached to 4 and this can be done by translation of pentagon 
5. In a similar way, 8 is attached to 3 by a translation of 1 and 12 is 
attached to 8 by a translation of 5. By simple geometric arguments, 
this continues to hold every time that one attaches a new pentagon at 
an unfolding. 

We assign to every pentagon an interior orientation depending on 
wether it is placed "up" or "down" as shown in figure 5(b). 

Now we add pentagon 4 to 5. See figure 5(a). 5 has already the 
orientation given by the condition that is placed "down" and its edge 
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(b) 



Figure 6: Orientation of face 5. 

d is placed horizontally. From table 1, fifth row, the first five data cor- 
respond to the edges of pentagon 5. We know that edge d has already 
been assigned number 5' which yields the interior counterclockwise 
orientation 

(d,5>), (MO, MO, (r,20, (*,10. 

See figures 1 and 6. 

We proceed to give an orientation to pentagon 4. This pentagon 
is placed "up" and the common edge h with pentagon 5 has already 
been assigned 4'. Looking at table 1, fourth column, the orientation 
obtained for pentagon 4 is 

( 9 ,2'), (MO, MO, (MO, MO- 

See figures 1 and 7. 

Now we add pentagon 3, which must be placed "down" and the 
common edge with pentagon 4 is g. This edge has previously been 
assigned number 2'. Looking at table 1, third column, the orientation 
for pentagon 3 is 

(rMO, (n,30, (5,2'), (MO, (/,50 
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Figure 7: Orientation of face 4. 
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Figure 8: Orientation of face 3. 
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(A) 




Figure 9: (A) The algorithm gives the list L associated to the directions 
connecting the centers of adjacent pentagons. (B) Graphic of the single 
chain. 



See figures 1 and 8. 

Initially, the center of pentagon 1 is settled so that it coincides with 
the origin of the chosen system of coordinates. If one continues this 
process, one obtains the graph of the single chain {1, 5, 4, 3, 8, 12, 7} 
as it appears in an unfolding. See figure 9 (B). To determinate if there 
is overlapping at this single chain, it suffices to calculate the distance 
between the center of pentagon 7 to the origin of coordinates. If this 
distance is greater than twice the radius of the circle circumscribing 
pentagon 1, then there is no overlapping. 

The algorithm gives as output in the case of this single chain the 
list L = {5', 4', 2', 4', 1', 5'}, which gives the vectors linking the centers 
of neighboring pentagons in the single chain. See figures 4 and 9. 
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then 



Algorithm 3.1: Single Chain Analysis(£, S, D) 

Begin 

comment: Read a single chain S with elements S[l], . . . , S[£] 

S <— Read 

comment: The first direction is always 5', and determines the 

orientation of S[2] = 5 

l = m 

comment: Orient pentagon S[3] 

if S[3] = 4 

(Pent[3] = {(g,2'), (o,l'), (p,5'), (MO, (c,3')} 
[Append direction 2' to L 
else if 5[3] = 10 

then { Pent $\ =.{(P> 2 ')> K 1 ')- (7,5'), OM'), (g,3')> 
[Append direction 3' to L 

end if 

comment: Append directions to L of other pentagons in S 

for v\ = 4 to v\ = £ 

do £l = %!] 
comment: Determine <5[i^i] PI S'f^i — 1] from data matrix D 

pi := %i] n %i - 1] 

comment: Save direction assigned to p\ previously 
L <— Store direction assigned to p\ 

comment: Assign orientation to pentagon £i compatible 

with orientation at previous face 
Pent[vi) = {(Perm(l),l'), (Perm(2),2'), (Perm(3),3'), 
(Perm(4),4'), (Perm(5), 5')} 
end for 
Return; 
end program 
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Keywords: Wavelets, approximate identities, zonal functions 
Wavelets are used to split up complicate signals into simpler parts reflecting 
different scales at different positions. The investigation of crystalline structures 
motivates studying wavelets on the three-dimensional sphere (S 3 ). 

1 Introduction 

It is an interesting and though task to construct wavelets for a sphere, because there is no 
naive approach that works. The biggest obstacle seemed to be to define a dilation. There 
are several approaches and the most successful ones are those by the group of W. Free- 
den (see for example [8]) and those of J. P. Antoine and P. Vandergheynst ([1], [2]). The 
approach by J. P. Antoine and P. Vandergheynst is a group theoretical approach where 
as the approach by W. Freeden is done by singular integrals and zonal functions. As it is 
mentioned in [3] both approaches are some how equivalent, even though a deeper study of 
the connection of both approaches seems to be missing up to now. We are interested in 
wavelets on the 3 dimensional sphere, which should be motivated by the following problem. 
In texture analysis, i.e. the analysis of preferred crystallographic orientation, the orienta- 
tion probability density function / representing the probability law of random orientations 
of crystal grains by volume is a major issue. In X-ray or neutron diffraction experiments 
spherical intensity distributions are measured which can be interpreted in terms of spher- 
ical probability distributions of distinguished crystallographic axes. In texture analysis, 
they are referred to as pole probability density functions. In general, if / is the orientation 
probability density function of a random rotation, then the spherical Radon transform IZf 
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integrating / over all 1-dimensional great circles C C 5 3 is simply the probability density 
function of g G C \ C C 5 3 . In this way the spherical Radon transform provides an appro- 
priate model of the X-ray diffraction experiment of texture analysis. 
To investigate the localisation properties of this spherical Radon transform we need wavelets 
on the 3-sphere. Our approach will use singular integrals and zonal functions. 



2 Preliminaries 

2.1 Surface hyperspherical harmonics 

If R 4 is parameterized in polar co-ordinates: 

x\ = r sin 9 3 sin #2 cos6\, 
X2 = r sin 8 3 sin ^ 2 sin#i, 
x 3 = r sin 9 3 cos# 2 , 
Xi = r cos #3, 

where < 6 2 , 3 < it and < 8± < 2ir, then the hyperspherical harmonics on S* 3 are 
defined by 



r klm = {2i) l i\ / 2 ( fc + 0(^ J-y- Ylm (e 1 , e 2 )c l k + _\(cos$ 3 ) sm l e 3 , 

y ir(k + 1 + 1)1 

where Yi m ($i, 82) are the well-known spherical harmonics on S 2 . In these co-ordinates, the 
(not normalized) SO (4)-invariant measure on S 3 reads 

da = sin 82 sin 2 8 3 dd\ dd 2 dd 3 . 

The basis {Y^im, < I < k, — I < m < 1} is in fact based on the reduction of the 
representation of 5*0(4) to representations of 5*0(3); each Yu m is an eigenfunction of an 
50(3) subgroup of 50(4) which leaves a selected point of 5 3 invariant. 

2.2 Function spaces 

We have for 1 < p < 00, the Lebesgue space 

L P (S 3 ) = lf:\\f\\ P =(-^ [ \f(x^ds(x^ 1 ' 



2tt 



s* 



and the weighted Lebesgue space for zonal functions, i.e. functions that depend only on 
the angle 9 and with t = cos 8: 
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2.3 Funk-Hecke theorem 

The ultraspherical or Gegenbauer polynomials {C*(£)} (identical to the Chebyshev poly- 
nomials of second kind U n ) build a complete orthonormal system on [— 1, 1] with weight 
y/l — t 2 . We denote by x ■ y = cos Z(x, y) =: t the scalar product of x and y in R 4 . 

Lemma 2.1. Let Y^(x), I G {1, 2, . . . , (n + l) 2 }, be an orthonormal basis of spherical 
harmonics of degree I. Furthermore, let 



q(/):= / f(y)Yj l (y)da(y). 
Js 3 

Then: 

N (n+1) 2 

E E %(mw 

1=0 1=1 

tends to f(x) as N — > oo, uniformly in x G S* 3 . 

Proof. The proof can be found in [10] or [12]. An immediate consequence is that the set 
of all spherical harmonics is dense in C(S 3 ) and hence in all spaces L P (S 3 ), 1 < p < oo. 
It should also be mentioned a remarkable result: Given 1 < p < oo, p ^ 2, there exists an 
/ G L P (S 3 ) such that the partial sums of the Laplace series for / do not converge in the 
L p -norm! (see [5] and mentioned in [10].) This is equivalent to the same result for Fourier 
series. 

Theorem 2.2. (Addition theorem) Let {Y^x); I — 1, 2, ..., (n + 1) 2 } be the set of (n+1) 2 
linear independent spherical harmonics of degree n and {Y^(x)} orthonormal on S 3 , then 

2 , ("+ 1 ) 2 
Cl(x .y) = — -r E Y n(*) Y n(y), x, y e S 3 - 

n + 1 1=1 

A very important theorem is 
Theorem 2.3. (Funk-Hecke) Let f G L\[-l, 1], then 

f{x ■ y) Y n (x) da(x) = Y n {y)^- [ f(t)C*(t)VT=Pdt. 

S 3 n + 1 J_ x 

Especially for n = we obtain 

f(x ■ y) da{x) = An f(t)y/l-t 2 dt. 

S3 J-l 

The spherical harmonics {Y^(x), I = 1, 2, . . . , (n + l) 2 } build a complete orthonormal 
system on S 3 in C(S 3 ) and L P (S 3 ), 1 < p < oo. Let / G L^S* 3 ) then / can be expanded 
into a Laplace series of spherical harmonics, we have 



S(f;x)~J2yn(f; 



x) 

n=0 
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where Y n (f; x) is given by 



n + 1 r 

Y n (f;x) = ^ r C 1 n (x-y)f(y)da(y), n = 0, 1, 2, . . . . (1) 

l S 3 



2tt 2 



and if / is zonal, i.e. depends only on the scalar product x • y — t, we have 

W; t) = f(n)C^t), where /(n) = - / fit) C x n (t) VT^¥ dt. (2) 



IX 



3 Spherical singular integrals 

The properties of the convolutions, i.e. Young's inequalities for groups and homogeneous 
spaces can be found in the appendix. An important type of singular integrals are singular 
integrals of convolution type which are generated by a singular kernel. This technique has 
been introduced in the Euclidean space by Mikhlin and ProBdorf [11], earlier by Calderon 
and Zygmund [6], and had been extended to spheres by Dunkl [7] and Butzer [4]. The 
spherical convolution is based on the convolution with zonal functions. For the sphere S 3 
this means following [4] 

Definition 3.1. For / G L 1 (S' 3 ) and g G L\[— 1, 1] the convolution h — f * g of / and g is 
defined by 

h ( x ) = 7^ f(y)g(x-y)dcr(y). 

2?^ Js : i 
The convolution has the following properties: 

Remark 3.2. Let be f G L P (S 3 ) and g G L\, 1 < p, q < oo, then is the convolution f * g 
is defined almost everywhere on S 3 and we have Young's inequality (see for example [13]): 

||/*<7||r<||/|| P ||<7lU - = - + --l>0, 

r q p 

in particular 

\\f * g\\ P < \\f\\ P \\g\\i,i and ||/*0||, < ||/||i||ff||g,i- 

The Laplace series expansion of f * g has the form 

Y n (f *g;x) = — — g(n)Y n (f; x), 
n + 1 

where g(n) and Y n (f;x) are given by (2) and (1) respectively. 

Based on this spherical convolution we are now able to define singular integrals on the 
unit sphere. 
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Definition 3.3. Let K^ G L\\— 1, 1], with h G (0, 1), be a family of kernels such that the 
coefficients satisfy 

K h (0) = - f K h (t)Cl(t)VT^t 2 dt = 1. 
n J-i 

Then the family 



W) =K h *f=^ I f(y)K h (. ■ y) da(y) 

I S3 



2tt 2 



is called a spherical singular integral while the family Kh, < h < 1, is called its kernel. 
A singular integral Ih is said to be an approximate identity in L P (S 3 ), 1 < p < oo, if 



lim \\I h f-f\\ p = 0. 

h — >1 



Remark 3.4. In [4] the convolution integrals defined above called spherical singular inte- 
grals. 

Theorem 3.5. Let Kh G L\[— 1, 1], with h G (0, 1) be a family of kernels such that 

1. it is a kernel of a spherical convolution integral 1^, that is, it satisfy 

1 r 1 

K h (0) = ^l K h (t)C^(t)VT^¥dt = 1; 

2. there exists a constant M > 1 such that for all h G (0, 1), we have 

-^ / |^(0| VT^dt<M; 
2vr 2 i 5 3 

5. /or every fixed S > 0, 

lim sup |-Kh(£)| = 0. 
fr^i~ -l<t<l-<5 

T/jen /or every / G L P (S 3 ), 1 < p < oo, £/ie convolution integral 

W) = K h *f = ^J^ f(y)K h (-; y) da(y) 

fulfills 

a) ||4/|| P <M||/|| p ; 

b) lim^ 1 -||4/-/|| p = 0. 
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Proof. Proposition a) is a consequence of Young's inequality and (2.). To prove the second 
proposition, we define 

S(x, 5) = {yeS 3 : x-y<l-5,5>0}. 

Then 



I h (f,x)-f(x) = ±- 2 \ K h (x-y)[f(y)-f(x)]da(y) 

A7r JS 3 \S(x,S) 

+-^ / K h (x-y)[f(y)-f(x)]da(y) 

= h+h- 

For Ji, we use the fact 

i \K h {x ■ y)\da(y) = 2vr f \K h (t)\ (1 - t 2 ) dt. 

By Young's inequality, Holder-Minkowki's inequality and (3.) we have that a given e > 0, 
there exists ho(e) such that 



O77- r 1 

;/,H, < — 2 2\\f\\ P J \K h (t)\Vi=Pdt 



2tt 2 ' 



< -||/|| p sup \K h (t)\<e, 

71 {— 1<*<1— <5} 

for h > h (e). 
Further, we get 

||/ 2 || P <^ / \K h (x-y)\\\f{y)-f{x)\\do-{y)<e-M, 

^ JS{x,S) 

if h < 1 — 8, due to Kolmogorov's compactness theorem (see for example [14] for L p , the 
one-point set {/} being relatively compact. Hence, 

||/ h /-/||p<||/i|| p +||/ 2 ||p<(M + l)e, 

for h < ho(e). D 

Lemma 3.6. Assume that the kernel {Kh}o<h<i is uniformly bounded in the L\-norm, i.e. 
there exists a positive constant M (independent of h) such that 

I \K h {t)Wl-t 2 dt< M (3) 
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for all < h < 1. Then the corresponding convolution integral {1^} is an approximate 
identity in L P (S 3 ), 1 < p < oo, if and only if 



lim Kh(n) 

h->l- 



n+l 



(4) 



for all non-negative integers n. 

Proof. For / G C(S 3 ) we have the Laplace expansion 



/(*) =£>„(/, 



X 



and we know that 



(f*K h )(x) 



n=0 



oo _. 



^ n + 1 

n=0 



K h (n)Y n (f,x) 



Let Pj be a spherical polynomial, e.g. a finit linear combination of spherical harmonics. 
From (4) we get that IhPi converges to P; for any spherical polynomial P\[x) in the L p -norm 
because in this case the Laplace series of IhPk has only finite many summands. Because 
the set of all spherical polynomials is dense in L P (S 3 ) (see Lemma 2.1) the convergency in 
the L p -norm follows from the Hahn-Banach theorem (density argument). 

Typical examples are the Abel-Poisson integral and the GauB-WeierstraB integral. 



4 Continuous Wavelet Transform 

4.1 Linear Theory 

We define spherical wavelets of order m — — 1. 

Definition 4.1. Assume that \x : [0, oo) — ► R + is a positive weight function. Let {^ p , p € 
(0, oo)}, be a subfamily of L\[— 1, 1] such that the following admissibility conditions are 
satisfied: 



for n = 0, 1, 



* p (n) n(p) dp = n + 1, 



(5) 



for all R E (0, oo) 



V p {t)n{p)dp 



R 



Vl - t 2 dt <T, 



(6) 



where T is a positive constant independent of R. 
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Then {^ p } is called a spherical (linear) wavelet of order — 1. The function ^ = ^ is called 
a spherical mother wavelet. The associated (linear) wavelet transform is defined by 



(WT%n(p. ,j) := ^ [ V p (x-y)F(x)da(x) 

's 3 



2vr 2 



for all F G L 2 (S 3 ). 

Then 

Theorem 4.2. (Reconstruction formula). Let {^ p } be a wavelet (of order — 1). Then 
F G L 2 (S 3 ) can be reconstructed in L 2 -sense by 

F(y) = J (WT)(p, y) fi(p) dp = J — J ^ ^ p {x ■ y)F(x) da(x)fi(p) dp. 

Proof: Let R be a positive number. Then 

(WT)(F)(p, y )^p)dp=^- 2 [ [ y p (x-y)F(x)do-(x)fi(p)dp 

R 27r JR J S* 



\ f $ R {x-y)F(x)do-(x) = {$ R *F){y), 



where 



2tt 






poo °° 1 

= / J2~ -r^ P (n)Y n (F;y)p J (p)dp 

because of (3) and (1). Hence 

^^(n)/i(p)d/)C7^.j/). 

- E 71=0 

According to our construction the kernels $# G Lf[— 1, 1] for all R > (cf. [4]) and due 
to (5) 

lim & R (n) = n + 1 Vn eN . 

R-^0+ 

From 

/•OO °° POO 

**(*) = / E *» c «w ^) ^ = / *p(0 M d P 

JR n=0 ./.R 
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and (6) we deduce that the kernel {$_r} is uniformly bounded in the sense of (3) and thus 

1 f °° n+ 1 

\im + — J 3 $ R (x ■ y) F(y) da(y) = £ "^iT W *) = F ( x )- D 



?1=0 



Remark 4.3. Analogously to [8] there would be another condition: 
for R G (0, oo) 



J2(n + 1) i>{n)n{p)dp 

n=0 J R 



< OO. 



This condition is needed for the non-negativity of the kernel which is important for the 
approximation properties but neither for the definition of wavelets nor the reconstruction 
formula. 

Remark 4.4. The defining properties of a spherical wavelet of order —1 do not presume the 
zero-mean property of ^/ p (or equivalently ^/ p (0) = 0). This coincides with the observation 
in [8] for the sphere S 2 and in [1] that the zero-mean condition is not a necessary condition. 
Here, we have a substantial difference to the classical (Euclidean) wavelet concept on the 
real line. 

Definition 4.5. Let {^ p }, p £ (0, oo), be a subfamily of L\[— 1, 1] such that the admis- 
sibility conditions are satisfied: 

• for n — 1, 2, . . . 



for p G (0, oo) 
for all R G (0, oo) 



* p (n) p(p) dp = ra + l, 



%(0) = o, 



(7) 



* P (t)(*(p)dp 



R 



Vl - t 2 dt <T, 



(8) 



where T is a positive constant independent of R. 



Then {^ p } is called a spherical (linear) wavelet of order 0. The function ^ = ^i is called 
a spherical mother wavelet and the associated wavelet transform is defined by 

{WT)(F)(p;y) = (* p (y),F)„ m 



for all F G L 2 {S 3 ). 
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Theorem 4.6. Let {^ p } be a wavelet of order 0. Suppose that F G L 2 satisfies Y (F; x) = 
0. Then 

F ( x ) = A r(WT)(F)(p, x)p{p)dp. 
^ Jo 
Proof: Let R be a positive number. Then 

(WT)(F)(p, y )^p)dp=^- 2 [ [ y p (x-y)F(x)do-(x)p(p)dp 
r 2tH J r J S 3 



$ R (x • y) F(x) da(x) = ($« * F)(y), 

s 3 



where 



OO l> POD 



WJ / *p( a: --V) /,, (-0^(.0/'(^)^= / (*,,* F)(u) tt(f>) dp 



R JS 3 JR 

OO 



poo _ 1 

— / ^* P (n) / C 1 n {x-y)F{x)da{x)p J {p)dp 



2 
because of (3) and (1). Hence we can choose 

$ R (x-y) = l+ S^i! p {n)p{p)dpC 1 n {x-y). 
J R „=i 

According to our construction the kernels $r G Lf[— 1, 1] for all R > (cf. [4]) and due 
to (7) 



lim $ R (n) = n + 1 Vn G N . 



From 



/OO "" /"OO 

^ £» C*(t) p{p) dp = l+ V„(t) p(p) dp 
- l n=l J R 

and (8) we deduce that the kernel {$_r} is uniformly bounded in the sense of (3) and thus 

If °° n+ 1 

\im + — / g $ fi (:r • y) F(y) d<r(y) = £ -^r W *) = ^). □ 

" " n=i 

Remark 4.7. The linear analysis here may be formally interpreted as bilinear analysis: 



F(x)= / / (WT)(F)(p;y)S(yx)da(y)fi(p)dp, 

Jo Js 3 

where 

OO 

5{x-y) = Y^{n+l)C l n {x-y) 

n=0 

is the Dirac distribution. 
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4.2 Bilinear Theory 

The concept of dilation and translation becomes more evident in bilinear theory. 

Definition 4.8. Let \x : [0, oo) — ► R + be a positive weight function and {^ p , p G (0, oo)}, 
be a subfamily of Lf[—1, 1] such that the following admissibility conditions are satisfied: 

• for n — m, m + 1, ... 

/■oo 

i> p {n)p(p)dp = (n + l)\ (9) 



o 



for n = 0, 1, . . . , m, and allp G (0, oo) 

* P (n) = 0, (10) 

for all R G (0, oo) 



{y P *y P ){t)n{p)d P 

R 



Vl-t 2 dt < T, (11) 



where T is a positive constant independent of R. 

Then {^ p } is called a spherical wavelet of order m. The function ^ = ^i is called a 
spherical mother wavelet. The associated (linear) wavelet transform is defined by 



(WT)(F)(p,y) := ^ J * p ( x ■ y)F(x) da(x) 



2tt 2 



for all F G L 2 (S 3 ). 

Then 

Theorem 4.9. (Reconstruction formula) . Let {ty p } be a wavelet (of order m). The wavelet 
transform is invertible on the range of all functions F G L 2 (S 3 ) with F(k,i) = for 
k — 0, 1, . . . , m and i — 1, . . . , (k + l) 2 , in L 2 -sense by 



F (y) = ^ii I (WT)(p,y)^ p (x-y)p(p)dpda(x). 
>s 3 Jo 



2tt 2 
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Proof: Let R be a positive number. Then 

^JJ (WT)(p,y)^ p (x-y)p(p)dpda(x) 

= ^/ / ^~2 I ^ p (x-z)F(z)da(z)^ p (x-y)p(p)dpda(x) 
2tH J ss Jo 2tH J S 3 

= ^2 / 3 / 2^2/3 *^ x ' z ^p( x ' ^) ^) d P da ( x ) F ( z ) da ( z ) 

e(yz) 

= 2W / E*W/ C7*(«-«)C7 fc 1 (a;.y)Ai(p)dpdcr(a;)F(«)dtT(«) 



1 /* _°°_ 1 Z* 00 

= ^-2j s3 J2k^J R K m (p) d P°K x ■ Z ) F ^ Z ) M z ) 

= (G R *F)(y). 
Hence 

and uniformly bounded due to (11), further because of (9) we have 

lim Q R (k) = k + 1, for all fc = m + 1, m + 2, . . . . (12) 

Hence by (11) and (12) {Or, R > 0} is the kernel of an approximate identity and thus 

hm(Q R *F)(y) = F(y) 

H — >U 

in the L 2 (S' 3 )-norm. □ 

We can rewrite the reconstruction formula as: 

pod 

F= / (y p *y p *F)(.)p(p)dp. 
Jo 

In the bilinear theory reasonable to introduce a scaling-function. 

Definition 4.10. The corresponding scaling-function {$«, R > 0} for a family of wavelets 
{\P p , p > 0} of order m is defined by 

{fc + 1, k — 0, 1, . . . , m, 

(ln^ P (k)Mdp)\ k = m + l,.... 
It can be shown that that the scaling function $# belongs to L 2 (S 3 ) and that 



lim ($ R * $ R * F) (y) = lim (G R * F) (y) = F(y). 

R — >0 R — >0 



For details see [9]. 



122 



BERNSTEIN-EBERT: ON WAVELETS 



5 Wavelets 

The wavelets of order m corresponding to the Gau-Weierstra kernel look as follows: 



* 



H 



0, A; = 0,1, 

i 



(2k(k + l) 2 (k + 2)e- 2k{ - k+ ^ R ^-\R)) 2 , k = m + 1, m + 2, . . . . 
Those corresponding to the Abel-Poisson kernel are 



* 



fc = 0, 1 



, J-, • • • , " V-j 



m, 



R 



(2k(k + lfe- 2kR n~ 1 {R)) 2 , k = m + l,m + 2, 



We plot Abel-Poisson wavelets of order and choose /x(p) = -?. It is not so simple to 
visualize wavelets on the 3D sphere, which is a subset of 1R 4 . To overcome the dimensional 
problem we consider only the upper half sphere 

4 

S 1 ^ := {(xi, x 2 , x 3 , £4) G M 4 : /^a^ = 1, and Xi > 0}, 



which can be identified with the unit ball in M 3 by 



E 



•r? 



E 

j=2 



.?',■ 



1 - Xl < 1 



For a fixed x we obtain a 2D sphere in M 3 with radius y/1 — x 2 . The union of all spheres 
with < x\ < lis the unit ball in M. 3 . Abel-Poisson wavelet with p = 0.2 on the slices 
x = 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 : 












ni:::i 
t ~r 3 

rr 





-IQ- 
-2>- 
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It is easily seen that the wavelets localization is good, small positive and negative values 
are due to the fact that p = 0.2. 

In the next figures the influence of p is demonstrated by the slice X\ = for varying values 
of p. The delta peak is very good for small p and expands for larger values of p. Abel- 
Poisson wavelet with p = 0...3 on the slice x = 
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Abstract 

We survey the properties of ideal projectors and structure of the family 
of ideal projectors onto a given finite-dimensional space of polynomials. 
Particularly we establish relations between ideal projectors, commuting 
matrices, zero-dimensional ideals, solutions of systems of PDEs and cer- 
tain topics in algebraic geometry, such as Hilbert and border schemes. 



1 Menu 

As the title indicates, this survey does not offer a full course on ideal interpo- 
lation but rather, following the culinary analogy, a sample of what is available 
in this exiting area of multivariate interpolation. Although many of the results 
presented here are true for the real field as well as the complex field, I will 
limit myself to working in the space C[x] := C[xi, . . . ,Xd\ of polynomials in d 
variables with complex coefficients. 

Definition 1.1 ([BiJ) A linear idempotent operator P : C[x] — ► C[x] is called 
an ideal projector ifkerP is an ideal in C[x]. 

Lagrange interpolation projectors, Taylor projectors and, in one variable, 
Hermite interpolation projectors are all examples of ideal projectors. For this 
reason the ideal interpolation holds a promise of elegant theory of multivariate 
extensions of univariate properties, which as a rule tend to be a messy subject. 
The brilliance of Birkhoff 's idea is in restricting the domain of the projectors 
to the space (ring, algebra) of polynomials C[x] thus allowing a whole slue of 
various mathematics to come into play. As luck would have it, the problems in 
ideal interpolations are closely related to problems in commutative and linear 
algebra, algebraic geometry and PDEs. Here is what on the menu: 
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1.1 Approximation Theory (AT) 

The main objective of this paper is to study ideal projector onto a fixed TV- 
dimensional subspace G c C[x]. We will denote the family of all such projectors 
by CPg- By the end of this paper we will get to a remarkable fact: the structure 
(geometric, metric, topological) of tyc depends on G and differs for different 
spaces G of the same dimension. 

1.2 Commutative Algebra (CA) 

Every ideal projector P G *Pg determines a decomposition 

C[x] = J©G (1.1) 

with the ideal J = kerP. Thus studying *Pg is equivalent to studying the set 
-3g °f ideals in C[x] that complement G. Given an ideal J G Zg-> the space G 
spans the finite-dimensional quotient algebra C[x]/J. 

1.3 Algebraic Geometry (AG) 

Every ideal J G C[x] generates a subset 

Z(J) = {z G C d : /(z) = for all / G J} (1.2) 

which is called an affine algebraic set (variety, scheme). Given a basis q — 
(<7i, ,5jv) of G we will construct an affine algebraic set, called the border scheme 
B s that parametrizes <Pg (equivalently 3g) in a natural (continuous way). 

1.4 Linear Algebra (LA) 

A sequence of L = (L 1; . . . , L^) of commuting linear operators on G is called 
cyclic if there exists a vector go G G such that 

{/(L 1 ,...,L d ) 5o :/GC[x]} = G (1.3) 

The vector go is called a cyclic vector for L. Let Zq stand for the family of all 
such sequences. With every P G tya {J € 3g) wc w iU associate an L g£g and 
vise versa. 

1.5 Duality (PDE) 

Let g = (<7i, . . . , g^) be the linear basis for G C C[x]. Every finite-dimensional 
projector P on C[x], ideal or not, can be written as 

P = Y,9k®F k (1.4) 

where (Fk,k = 1 : N) consists of functionals in C'[x] dual to (gk,k = 1 : N), 
i.e., (F,gj) = Sj t k- The space 
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span {F k } = ran P* = (kerP)- 1 (1.5) 

is correct for G, thus for every P G ran P* we have F(f) — F{Pf) for all 
/ € C[x]. In other words P "interpolates" the functionals F € (kerP)- 1 . We 
will identify the functionals in C'[x] with formal power series in C[[x]] and show 
that the ideal projectors correspond exactly to P-invariant subspaces of C[[x]]. 
These subspaces are precisely the spaces of solution of homogeneous systems of 
PDEs with constant coefficients, hence the acronym. 

1.6 Putting it all together 

In this paper we will explain interrelationship between the notions mentioned 
above, present result and some open problems that are based on intricate in- 
terplay between these diverse fields. To keep the size of the paper reasonable I 
had to make a choice between rigorous proofs and accessibility of the material 
to a reader not terribly familiar with algebraic geometry (such as myself). I 
chose the latter. Thus many of the proofs are going to be only highlighted (with 
detailed references to the original papers). Instead I will attempt to illustrate 
the results with generous serving of concrete examples. Bon appetit! 



2 Antipasto 

2.1 de Boor's equation 

An obvious property of multiplication on C[x]/J: 

[/[<?]] = \f 9} (2-1) 

translates into the following characterization of ideal projectors : 

Theorem 2.1 (Carl de Boor [Bol])A linear operator P : C[x] — > C[x] is an 
ideal projector if and only if 

P(l9) = P(f ■ P(g)) (2-2) 

for all f,g € C[x]. 

This theorem implies that unlike an arbitrary projector onto G, an ideal 
projector is completely determined by its values on a small set of polynomials. 
This makes sense. Imagine that you know that P is a Lagrange interpolation 
projector onto the space of polynomial of degree less than N in one variable. 
Knowing Px N we can form a polynomial x N — Px N which has exactly N zeroes. 
These zeroes are the interpolation nodes (sites) for P, thus Px N determines P. 
In algebraic term the ideal (x N — Px N ) generated by x N — Px N is the kernel 
of P. 



128 SHEKHTMAN: IDEAL PROJECTORS 



2.2 The border bases 

Here is the general situation: 

Let g = (<?i, . . . , gjy) be a linear basis for G. We define the border of g to be 

dg--{l,Xig k , i = l,...,d, k = l,...,N}\G. (2.3) 

For every J £ 3q, the decomposition (1.1) induces an ideal projector Pj 
onto G with ker Pj = J. From (1.1) it follows that for every ideal J £ 3c and 
for every b £ dg there exists a unique (!) polynomial pb = Pjb £ G such that 
b — pb € J. As it turns out, the set {b — pb, b £ dg} forms an ideal basis for J, 
called a (generalized) border basis. 

Proposition 2.2 Let J £ 3g a nd for every b £ dg let pb := Pjb be the unique 
polynomial in G such that b — pb £ J. Then 

(i){b — pb, b £ dg} forms an ideal basis for J. 

(ii) If Pf — 2_, a a(f)x a £ G then the coefficients a a (f) are polynomials 

in the coefficients of polynomials {Pb, b £ dg}. 

The simple proof of (i) can be found in [Bol] and equally simple proof for 
(ii) is in [S5]. 

What about a converse? That is, what polynomials (pb, b € dg) C G have 
the property that the ideal {b — pb,b € dg) is in 3^? This is the question first 
dealt with in [Mo] with some additional assumptions on G (cf. also [Bo3] and 
[KR, 6.4B], ). The extension of their results is presented below. 

Mimicking the terminology of [KR, 6.4B], we will characterize those border 
prebases that are border bases. We will present necessary and sufficient condi- 
tions on polynomials {pb, b £ dg} for {b — pb,b e dg} to be a basis for an ideal 
in Jq. As in [KR, 6.4B], the criterion involves formal multiplication operators 
Mj :G^G defined by 

M l9k = I X ^ '% X * 9k % G r (2.4) 

I Px i9k if Xig k £G 

Here is the main theorem of this section: 

Theorem 2.3 Let (pb,b G dg) be a sequence of polynomials in G. Then the 
ideal (/ — Pf,f £ dg) £ 3q if and only if 

(i) MiM k = M k Mi for alli,k = 1, . . . ,d, 

(ii) g{M 1 , . . . , M d ) Pl = g for all g £ G. 

Proof. First assume that J = (b — pb, f £ dg) £ 3 a and let Pj be the ideal 
projector onto G with ker Pj = J. Then Mig = Pj(xig) for all i = 1, . . . , d. It 
follows from (2.2) that 

MjM k (g) = Pj(x t Pj(x k g)) = Pj(x t x k g) = P.j{x k x t g) = M k Mi(g) 
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which proves (i). Also observe that if g = J] a tt x a , then, for M := (Mi, . . . , M4) 
and go := PI, we have 

5 (Mi,...,M d )<7 = g = ^a a M a (Pjl) = ^2a a Pj(x a (Pjl)) 

= X! a «^( x ") = P ^(E «<**") = P ^ = 9 

which proves (ii). 

Now, suppose that (i) and (ii) holds. Then the mapping ip : C[x] — > C[x] 
defined by 

y./ - /(M) Pl 

is a ring homomorphism, hence it kernel 

K := ker p = {/ G C[x] : /(M)pi = 0} by i U) {/ G C[x] : /(M) = 0} 

is an ideal in C[x]. By (ii) the range of tp is G and K n G = 0. By the funda- 
mental theorem of homomorphisms C[x]/_ftT is isomorphic to G. In particular 
codimension of K is equal to dim G and if complements G. 

Let /ib be the unique element in G such that b — hb E K. We need to show 
that J — K or, alternatively that /i^ = gt, for every 6 G dg. Since b — hb G K 
we have 

= (6(M) - h b (M)) Pl by i U) b(M) Pl - h b 

On the other hand, by definition of M, we have 6(M) Pl = p b which implies that 
Pb = hb for all b G <9g. ■ 

Remark 2.4 If G is a D '-invariant subspaces of k[x] spanned by monomials, 
then 1 G G and, oy i/ie D-invariance, the condition (ii) of the Theorem 2.3 is 
automatically satisfied (see example section 3). Hence the theorem 2.3 gener- 
alizes theorem 6.4-30 of [KRJ with, what seems to be, a shorter, simpler proof, 
courtesy of the language of ideal projectors. 

2.3 The border scheme 

The operators Mi , . . . , Mk can be written as N x N matrices in the basis g and, 
the polynomial p\ G G generates an N x 1 matrix of its coefficients. 

Definition 2.5 The affine scheme B s defined by the ideal I B generated by the 
entries of the matrices MjMi — MiMj, i,j — 1, . . . , d and the coordinates of the 
vector pi : 

{g k (M 1 , ..., M d ) Pl -g k ),k = l,...,N (2.5) 

is called the generalized border scheme for g or Q-border scheme. It parametrizes 
the family of ideals 3q or, equivalently, the family of ideal projectors ^q . 

Proposition 2.6 It is clear from construction ideal projector P G *Pg (ideals 
J G 3g) are i n one-to-one correspondence with the points in B g . 

Thus we will sometimes refer to P <G *Pg (J € 3g) as a point P G *Pg- 
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3 Aperitif 



In this section we will illustrate how the notions of the previous sections apply 
for concrete spaces G. About the easiest multivariate space one can find, is 
the space of linear function in C[x,y]. As you will see, even in this case the 
computations are quite involved. Nevertheless, they are worth going through. 
Once accustomed to it, this example acts like a Led Zeppelin record: when 
played backwards, it sends you messages about the general theory. (For the 
univariate theory see [SI]). 

We will now attempt to determine all ideal projectors onto the three dimen- 
sional subspace G C C[x, y] spanned by its basis g = (l,x,y). 

By the theorem 2.3, to describe ideal projectors P onto G we only need 
know the values of P on the set of monomials x 2 , xy and y 2 . In other words the 
polynomials x 2 — Px 2 , xy — P(xy) and y 2 — Py 2 will form an ideal basis for the 
ideal ker P. 

Assume that P is an ideal projector onto G and 



Px 2 = a + b x + c y, 
Pxy = a\ + b\x + ciy, 
Py 2 = a 2 + b 2 x + c 2 y. 



(3.1) 



We need to find conditions on nine coefficients (do, Oi, a 2 , &0i b\, b 2 , Co, C\, c 2 ) that 
guarantee that the ideal 

(x 2 - Px 2 , xy - Pxy, y 2 - Py 2 ) 

complements Gl 

To answer this question we form formal multiplication matrices 



Mi 



ciq a\ 

1 b h 
c c\ 



;M 2 



a\ a 2 

1 &i b 2 
Ci c 2 



(3.2) 



and use Theorem 2.3. For our choice of G, the conditions (ii) of the theorem is 
automatically satisfied. All that is left is to enforce the commutativity. The six 
quadratic equations obtained from MiM 2 — M 2 M 1 = arc 

(a h + oici) - (ai&o + a 2 c ) = 0, 

(oi + bobi + bici) - (boh + b 2 c ) = 0, 

(c\ + bico) - (a + b ci + c c 2 ) = 0, 

(aob 2 + 01C2) - (aih + a 2 ci) = 0, 

(02 + b Q b 2 + b lC2 ) - (b\ + feci) = 0, 

(62C0 + C1C2) - (01 + b\C\ + C1C2) = 0. 

A close examination reveals that there are a lot of redundancy among these 
equation. The solutions to these equations are given by 



a = -boci +cf + bic - c c 2 , 

ai = b 2 c - bici, 

a 2 = b\- c 2 b\ - b b 2 + b 2 c 1 . 



(3.3) 
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The border scheme B s is a six-dimensional variety affine variety in C 9 that 
consists of all nine-tuples 

(ao, oi, 02, &o, bi, b 2 , Co, ci, c 2 ) (3.4) 

satisfying (3.3). 

By checking (3.3) we see that the following four projectors define by 



T :Tx 2 = T(a;y) = Py 2 = 0, 
P* : P*:r 2 = y, P* (xy) = P«y 2 = 0, 
L : Lx 2 — x, L(xy) — 0, Ly 2 = y, 
P : P.x 2 = ffzy = Hy 2 = y 



(3.5) 



are in fact ideal projectors onto G. The first, T is the Taylor projector onto G, 
it interpolates / € k[#, y] and its first partial derivatives at 0. The dual space 
(kerP*)- 1 - is a P-invariant subspace of k[[x,y]] spanned by l,x,.x 2 + 2y. Thus 
P* also interpolates at zero various derivatives at zero, namely 

So,S oD x ,S o(D 2 x + 2D y ) 

and is a different projector. Hence, unlike the case in one variable there are two 
(infinitely many) ideal projectors onto G such that Z(kerP) = {0}. The projec- 
tor L is a Lagrange projector interpolating at sites (0, 0), (1, 0) and (0, 1). The 
dual space (ker L) 1 - is a span of the power series expansion of three distinct ex- 
ponential functions: 1 — e°, e x and e v . Finally the last projector H interpolates 
at zero, at (f , 1) and the derivative D x at zero, (ker H) 1 - — span{e° , xe° , e x+y }. 

4 Soup and salad 

4.1 Cyclic commuting matrices 

Given P e CPg and i = 1 : d, we define multiplication operators Mi : G — > G by 

Mi(g) = P( Xi g). (4.1) 

These operators are similar (literally and figuratively) to the multiplication 
maps raj on k[x]/J defined by rrij([f]) :— [xjf] £ k[x]/J for every [/] € k[x]/J. 
A relationship between ideals, multiplication maps and numerical analysis was 
initiated and explored by H. Stctter [St]. Observe that these operators are 
precisely the operators defined in (2.4) with p Xi g k = P{xigk)- Therefore the 
sequence Mp := (Ap, . . . , Mj) is a cyclic commuting sequence with the cyclic 
vector g := P(f). 

Nearly all the information about P can be read off the sequence Mp: 

Proposition 4.1 Let if : C[x] — ► C[x] be defined by ip(f) = f(M.p)go & C[x]. 
Then 

(i) kcrP = ker^ = {/ G C[x] : f(M P )g = 0} = {/ G C[x] : /(M P ) = 0}. 

(ii) The restriction ip\ G of ip to G is an isomorphism on G. 

(tii) P = ^ |G J o f 
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The converse to this is also true (cf. [BS]): 

Theorem 4.2 Let L := {L\, . . . , Ld) be a cyclic sequence of commuting N x N 
matrices with a cyclic vector w G C N . Let ip : C[x] — ► C[x] be defined by 
¥>(/) - f(L)v G C N , let 

J L := {/ G C[x] : /(Mp) = 0} (4.2) 

Then and L is similar to the matrices of multiplication operators Mp of any 
ideal projector P with ker P = Jl . 

4.2 Duality and PDEs 

The space C[[x]] D C[x] is the space of all formal power series in d variables. A 
generic element / G C[[x]] is written as a formal sum 

/(x) = ^/(a)x Q (4.3) 

where a = (a l , . . . aj) runs through all multiindices in Zl and x a = x" 1 ... x^ d . 
Given / G C[[x]] and a sequence L = (L\, . . . ,Ld) of commuting operators on 
some linear space V , we define a formal operator /(L) on V to be 

f(L)v = J2f(<x)Lr---L a d d v. 

In particular if D t are operators of differentiation on V — C[x] with respect 
to the variable X\ and D := (Z? 1; . . . ,-Dd) then for every F G C[[x]] and every 
/ G C[x] the pairing 

(F,f) := (F(D)/)(0) = £«!>(<*)/(«) 

defines an isomorphism between C[[x]] and the algebraic dual C'[x] of C[x]. It 
is easy to see that 

(F, Xi f) = (DiF,f), (4.4) 

in other words Di on C[[x]] is an adjoint of the operator of multiplication by xi 
on C[x]. 

Example: It is easy (and insightful) to check that the point evaluation 
functional S z :S x (f) = /(z) on C[x] corresponds to the functional F G C[[x]] 
defined by the expansion of e z ' x in powers of x. 

The following theorem due to Macauley [Ma] follows immediately from (4.4): 

Theorem 4.3 A subspace J C C[x] is an ideal iff J 1 ' C C[[x]] is D -invariant, 
i.e., F G J^ implies DiF G J ± . 

Thus the study of Zg (hence Zg) is equivalent to the study of _D-invariant 
subspaces $ C C[[x]] that are correct for G. 

Example: Keeping in mind the example above, we conclude that the La- 
grange interpolation projectors are determined by subspaces of C[[x]] which are 
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spanned by pure exponentials: $ = span{e Zj x .j — 1 : N} while the Taylor 
projector is determined by a span of pure polynomials: $ = span{x. a : \a\ < 
n} =: C[x]<„. 

As the multiplication matrices are associated with the ideal projector, the 
adjoints of those matrices determine the rules of differentiating functions in 
(ker P) ± . Let J € 3q- For every P <G J ± we denote the restriction of F to G by 
F* . Since G complements J, it follows that J- 1 is an iV-dimensional subspace 
over G, hence the dual space G* is 



If Pj is the ideal projector onto G with ker P = J, then 



where q = (</i, . . . , g^) is a basis for G, and (Pi, 
J- 1 , i.e., Fj(gk) — Sj t k- The adjoint operator P* : (< 
Pj — J2 Fj 5j having J- 1 as its range. 

Theorem 4.4 Prf J e 3 G and let Mj = (M 1: 
multiplication operators: 



,F 



N, 



(4.5) 



(4.6) 

is the dual basis in 
C[[x]] is a projector 



, Mj) be the matrices of 



in the basis 



Fj(g k ) 



y,k- 



Then 



Mi(g) = Pjfag) 
, (/at). Let (Pi,...,P/v) be the dual basis in J- 1 , i.e., 

' Pi 1 [Fi " 

A : =M* : (4.7) 



" Pi " 


= M* 


" Pi " 


Fn 




Pjv 



Proof. For every jeGwe have F{M t g) = F*(M t g) = {M\F*){g). On the 
other hand 

P(M lff ) = P(P(x l5 )) = (P*P)(x l5 ) = P(x l5 ) = (AP)( ff ) = (DiF)*(g), 

where the third equality follows P <G J x =ran P*. Hence (MfF*) = (D,P)*. 

iW p* „,v, OTO ™ (1) is the j, /c-entry in the 



This means that (P^Pfc)* = ^rn Kl ' k F* , where nv- fe 



« 



matrix M*. On the other hand, by the 9-invariance, P^Pc = ^ a^ ^.Pj for some 



,W 



W 



coefficients or- [ . Since (P* ) is a basis in G* it follows that a- k 



l j,k- 



4.3 Primary ideals 

An ideal J d C[x] is called primary if fg € J implies f m G J for some m G N 
or g <G J. 

Theorem 4.5 Lei J be a zero- dimensional ideal in C[x]. T/ien £/ie following 
are equivalent: 

(CA) J is primary 



134 SHEKHTMAN: IDEAL PROJECTORS 



(G) Z(J) — {z}, i.e., Z(J) consists of one point 

(PDE) J- 1 = e z ' x M where M C C[x] is a d-invariant subspace of polynomials 

('■)■ 

(LA) a(M L ) = {z}. 

The equivalence of (CA) and (G) is standard (cf. [CL01], [BR]). The (PDE) 
was explored in [M] (cf.also [Bol]). The (LA) follows from [MSh]. 

4.4 Primary decomposition 

Definition 4.6 Let L := (L\, ...,Ld) be a d-tuple of operators on V. A direct 
sum decomposition 

v = Vi e v 2 e ... © v t (4.8) 

is Jj-invariant if each subspace Vk, k = 1, ...,£ is an invariant subspace for each 
of the operators Lj, j — 1, ..., d. 

Letting Lj^ :— Lj \y k denote the restriction of Lj onto Vk we write 

L fe =L|v fc :=(Li, fe ,...,£d,fc)- (4-9) 

The simultaneous block-diagonalization of L into t blocks amounts to noth- 
ing more then the L-invariant decomposition (4.8) of V: Indeed, for an appro- 
priately chosen bases, the matrix Lj of Lj can be written in a block-diagonal 
form 



Lj 



L jA ••• 
Lj-,2 ••• 

■■■Li 



(4.10) 



J 3,t 

A d-tuple A := (Ai . . . Xd) € C d is called an eigentuple of L if there exists 
a non-zero vector v G such that LjV — \jV for all j = 1 : d. The set of all 
eigentuples of L is called the joint spectrum of L and is denoted by c(L). The 
next proposition seems to be well-known among experts. For a cute proof we 
refer to [MSh]: 

Proposition 4.7 Let L be a d-tuple of pairwise commuting operators on V . 
Then 

(i)a(L)=Z(J Ij ). 

(ii) There exists an Li-invariant decomposition ofV: 

V = V 1 (BV 2 ®...®V #a{L) . (4.11) 

Adding the assumption that L is cyclic gives us more: 

Theorem 4.8 (cf. [MSh]) If L is cyclic then (4-11) is a unique maximal L- 
invariant decomposition ofV: 

V = ©A6«r(L)Vx, 
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i.e., the space V cannot be decomposed into more than #<r(L) of L- invariant 
subspaces. More over er(L|y A ) = {A}, hence consists of a single eigentuple. 

The reason is that if L is cyclic then eigenvectors for L in different Vk and 
Vj correspond to different cigentuples. 

Theorem 4.9 Let J be a zero-dimensional ideal with Z(J) = {zi,...z m }. 
Then 

(LA) There exists unique (up to the order) maximal Mj -invariant decom- 
position 

G = G'i©G 2 ©...©G* m (4.12) 

and a{M JlG .) = {zj}. 

(AT) The ideal projector Pj has a unique maximal decomposition as a sum 
of m ideal projectors: 

P = P 1 + ...P m (4.13) 

where each Pj is an ideal projector onto Gj interpolating at exactly one point 
and PkPj — dk.jPj- 

(CA: Lasker-Noether) There is unique (minimal with respect to containment) 
primary decomposition 

J = DJj (4.14) 

with each Jj is a primary ideal with Z(Jj) — {zj}. 

(PDE) The subspace J- 1 C C[[x]] has a unique maximal decomposition 

J- 1 = e Zl ' x ■ H x © • • • © e z ™' x • H m (4.15) 

where each Hj is a D '-invariant subspace of polynomials. 

The (CA) is the famous Lasker-Noether theorem, (PDE) was observed by 
[M] (cf. also [Bol], [BR]) and follows from (CA) since (4.14) implies J 1 - = ©J/. 
(LA) and (AT) arc from [MSh]. 

5 Appetizers 

5.1 Radical ideals 

An ideal J C C[x] is called a radical ideal if f m € J implies / € J. 

Theorem 5.1 Let J C C[x] be a zero-dimensional ideal. Than the following 
are equivalent: 

(A): J is a radical ideal. 

(G): #Z(J) =codimJ 

(PDE): J- 1 is a linear span of pure exponentials: J" 1 = span{e Zk '*,k — 1 : 

N} - 

(LA): The matrices (Mi . . . Mj) — Mj are simultaneously diagonalizable. 

Moreover, the diagonal elements consist of interpolation sites for the ideal pro- 
jector Pj . 

(AT): Pj is a Lagrange projector interpolating at sites {zi . . . zj/} 
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Proof. Again, equivalence of (A) and (G) is standard (cf. [CLOl],). (PDE) 
follows from Theorem 4.5. (LA) was first observed in [St], cf. also [Bol] and 
[BS]; but now follows from Theorem 4.8. ■ 

Example: The projector L from (3.5) is a Lagrange projector interpolating 
at sites (0,0), (1,0) and (0,1). The multiplication matrices for this projector 
are 



M L = 





1 1 








1 1 



that is the matrices of Ml are diagonalizable. J- 1 
is the span of pure exponential. 




1 








i 




1 


- = sp 


an{ 


1 = 


e°,e x ,e y ) 



} which 



Remark 5.2 The equivalence of (A) and (LA) of the theorem was first observed 
by Hans Stetter [St]. He also noticed that the joint spectrum (diagonal pairs) of 
Ml are precisely the interpolation sites, which also follows from the Theorem 
4-8. More over the eigenvectors corresponding to these eigentuple are basic 
Lagrange polynomials that vanish on all points but one. 

5.2 Curvilinear ideals 



An ideal J is called curvilinear if there exists a linear form X — N^ a j x j such 
that J complements the space span{l, X, ..., J" -1 }. 

Theorem 5.3 For an ideal J G 3g the following are equivalent: 

(AT): J is curvilinear 

d 

(LA): The matrix M :— ^^ctjMj is non-derogatory. 

(CA) Every ideal Jk in the primary decomposition J — n| =1 Jk is curvilinear. 
(PDE): If J 1 - — fe=1 (e Zfc ' x .fffc) then each H^ contains at most one linear 
polynomial. 



Proof. Let Gx '■= span{l, X, . . . l" -1 } and let Q be an ideal projector onto 
Gx with Mq = (Li, . . . , Ld) and define Lx := X) a i^i- By the Theorem 2.3, 
(1, Lxl, ■ ■ ■ , L x ~ 1) spans the space Gx, hence ^ aiLi is non-derogatory. By 
the theorem 2.7, Mq is similar to Mp hence ^ aiM t is similar to Lx hence 
non-derogatory. This proves (LA). The converse: (LA)=>(AT) immediately 
follows from Proposition 4.1. To prove (CA), let Mj = diag{M { -j ) ) and Mi = 
diag(Mi^) as in (4.10). Then ^aiMi = ^ aidiag(Mi^) is non-derogatory if 
and only if ^ aiMi^ is non-derogatory for each k. 

Finally, to prove the equivalence of (PDE) to the rest of the statements of 
the theorem, observe that, by (CA) it is enough to prove this for primary ideals 
J with V(J) = {0}, i.e., with J x C C[x]. Without loss of generality, assume 
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that J £ 3c with G — span{l, X\, . . . ,X 1 1 }. This means that the dual basis 
for J^- is of the form 

{x k 1 +f k ,k = 0,...,N-l} (5.1) 

where f k do not contain the monomials {1, X\, . . . , x x ~ }. In particular, it (5.1) 
contains at most one linear form: x\ + f\. 

Conversely, assume that x\ is the only linear form in J- 1 and J is not curvi- 
linear. Then J- 1 could not have a basis of the form (??) hence no polynomial 
in J- 1 can contain a monomial x™ for m > N — 1, for otherwise its consecutive 
derivatives would produce a basis of the form (3.5). Since J 1 - is TV-dimensional, 
it must contain a polynomial F that has no pure powers of x\ in it. But that 
means that an appropriate derivative of F will give a linear form Xk with k =/= 1. 
Indeed let u = a;" 1 . . . x^ k . . . x^ d be a monomial of the highest degree in F such 
that Qffc > 1 for some k > 1. Then 

x k = (I?" 1 . . . D?- 1 ...x a /)u= {D^ . . . V?- 1 ...x a /)Fe J x 

which gives the desired contradiction. ■ 

Corollary 5.4 Every radical ideal projector is curvilinear. Converse is not 
true. 

Proof. Diagonalizing Mj and taking a generic linear combination J^ aiMi we 
conclude that SQ^ aiMi)S _1 has distinct elements on the diagonal, hence is 
non-derogatory Therefore J^ aiMi is non-derogatory and J is curvilinear. The 
second part of the statement follows from the example below. ■ 

Example: The kernels of projectors P* , L and H are curvilinear ideals. In 
particular, looking at the duals we see that (kerP*)- 1 contains precisely one lin- 
ear term while none of the Hj for (kerL)^ contain linear terms since (ker L) 1 - 
is a combination of pure exponentials. The ideal (ker P^) ± is not radical, yet 
curvilinear. It complements the span{l,y,y 2 }. The (ker H) 1 - has one summand 
with the linear term and one without. Both ideals (ker H) 1 - and (keri)- 1 com- 
plement the space, say, {1, x + y, (x + y) 2 }. The kernel of the Taylor projector 
kerT is not curvilinear since (kerT)- 1 contains two linear forms. 

Remark 5.5 The equivalence of (AT) and (LA) was observed in [BS] and [Co]. 
I learned the equivalence of (AT) and (PDE) from [Co]. The proof presented 
here is quite different, and in my opinion, much simpler then the one in [Co]. 

5.3 Gorenstein Ideals 

Definition 5.6 A zero-dimensional ideal J G $q is called Gorenstein if there 
exists one function F e C[[x]] such that T>(F) — J- 1 . 

Gorenstein ideals form an important class of ideal in commutative algebra 
(cf. [B]). 
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Theorem 5.7 Let J G 3g be a zero- dimensional ideal. The following are equiv- 
alent: 

(PDE) J is Gorenstein. 

(LAI) The sequence of adjoints MJ ~ {M^, . . . , MJ) is cyclic. 

(LA2) M*j is similar to Mj 

(CA) Every ideal Jk in the primary decomposition J — O s k=1 Jk is Goren- 
stein. 

Proof. M* is cyclic if and only if there exists anf e J 1 such that {p(M* )F*,p <E 
C[x]} = G*. This is equivalent to {(p(D)F)*,p e C[x]} = G* which, in turn, is 
equivalent to V{F) = {(p(D)F),p € C[x]} = P 1 . 

To prove (LA2) observe the ideals generated by Mj and Mj by (4.2) are 
equal. Thus, by the theorem 4.2 the sequence Mj is similar to Mj. ■ 

Example 5.8 The multiplication operators of the Taylor projector T from (3.5) 
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T is not a cyclic sequence, as was noted in [BS]. The reason for this is 
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that (kerT) = span{l,x, y} is not a deflation of a single polynomial. 

other hand the projector P* , being curvilinear is Gorenstein 
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and Mp is a cyclic sequence with cyclic vector (0,0,1). Observe that the dual 
(kerP*) 1 - are given by (l,x,2y + x 2 ) and T>(2y + x 2 ) = (kerP)- 1 . 
Finally consider F — xy, then F>(F) = span {l,x,y,xy} and 



J = {f€C[x,y]:P(f)=0,P€V(F)} 



(5.2) 



is Gorenstein, yet not curvilinear, since J- 1 contains two linearly independent 
polynomials: x and y of degree 1 . 

Proposition 5.9 If J is curvilinear, then J is Gorenstein. 

Proof. If J is curvilinear then there exists a linear combination M := ^2 a jMj 
such that Mo is non-derogatory. This implies that M is non-derogatory hence 
cyclic. Thus M* = (M*) is cyclic and J is Gorenstein. ■ 

Theorem 5.7 has an interesting and unexpected corollary to linear algebra: 
Every square matrix M is similar to its adjoint. This is not the case for 
sequences of commuting matrices as is seen in the example where My is not 
cyclic. From equivalence of (LAI) and (LA2) of the theorem 5.7 we immediately 
obtain the following corollary: 



Theorem 5.10 A cyclic sequence L = (L\,. 
L* = (I/*, . . . , L*j) if and only if L* is cyclic. 



. , Ld) is similar to its adjoint 
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5.4 Hermite projectors 

Definition 5.11 A projector P G CPg is called Hermite if there exists a se- 
quence of Lagrange projectors P n such that 

Pnf -» Pf (5.3) 

for every f G C[x]. An ideal J = kerP for Hermite projector P is called a 
Hermite ideal 

Proposition 5.12 Let J = n Jk be a primary decomposition of the ideal J € 
-3g. Then J is Hermite if and only if each Jk is. 

I have a messy proof for this yet I feel that there should exist a short concise 
one that, so far, I failed to find. 

Every Lagrange projector is Hermite. In one variable every ideal projector 
P is Hermite since the polynomial h £ C[x] that generates the ideal kerP can 
be approximated by polynomials h n of the same degree with distinct zeroes. 
The ideal projectors with ideals generated by h n are Lagrange. 

6 Main course 

My personal journey through the landscape of ideal projectors began with a 
question of Carl de Boor [Bol]: Is every finite-dimensional ideal projector Her- 
mite? Ironically there where questions in other areas of mathematics that at 
the end turned out to be the same or almost the same questions. Here is the 
list: 

Problem 6.1 (AT, [Bol]) Is every finite-dimensional ideal projector Hermite? 

(AG, [FoJ) Is the border schemes B s irreducible 

(LA, [Ge], [GuJ) Can every d-tuple of commuting matrices be approximated 
by commuting diagonalizable d-tuple of matrices 

(PDE, [Le]) Can a space of solutions of the homogeneous systems of PDEs 
with constant coefficients be approximated by spaces of pure exponentials. 

With the hindsight of the previous chapters, you can guess that the answer 
to all these question is the same. It turns out to be: "Yes" in one and two 
variables and "No" in three or more variables. For (AT) and (PDE) this was 
done in [S2], for (AG) in [Fo] and [la] 

John Fogarty [Fo] proved a remarkable theorem, which translated from 
Hilbert schemes to border schemes says: 

Theorem 6.2 If q spans an N -dimensional sub space ofC[x,y] thenB B is smooth, 
connected, irreducible ajfine variety of dimension 27V. 

Before we explain the words smooth and irreducible, let us lay the ground- 
work for asking questions. Are any of the statements in the theorem 6.2 true 
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for border schemes in more than two variables? Do the answers depend on 
particular space G? 

Surprisingly the answer to the first question is "No" and to the second is 
"Yes" . Somehow, in little understood way, the structure of the border scheme 
depends strongly on the subspace G. 

6.1 Topology 

Next proposition says that various topologies on B g are all the same: 

Theorem 6.3 Let P n and P be in *Pg -The following are equivalent 
(AT) P n f - Pf for every f 
(AG) P n — > P as points in B g 
(LA) The matrices M Pn -*■ M P 
(PDE) For every F <G (ker P) 1 - there exist F n € (kcr Pn) 1 - such that F n (f) -* 

F (f) f° r ever v f e c[x]. 

Proof. The equivalence of (AT), (AG) and (LA) are immediate from Proposi- 
tion 2.2 (ii) and constructions of Mp. The equivalence of (PDE) is intuitively 
obvious. Since P n and P are totally determined by their kernels and ranges and 
the ranges are the same, the kernels have to converge. The rigorous proof takes 
a little bit of doing (cf. [S3], [S4]) ■ 

6.2 Irreducibility 

A scheme (afiine variety) Z called irreducible if Z is not a union of two proper 
subvarieties of Z. So what does the irreducibility of B g has to do with Hcrmite 
projectors? 

Before answering this question, let us point out a few facts. In what follows 
we will freely identify the projectors in ^q with points in B g . 

Proposition 6.4 The set C g := {P € B s : P is Lagrange} is a Zarisski open 
subset of B g , i.e., the complement to L g is a subvariety of B g . In particular, 
Zarisski and Euclidean closure of L g in B g coincides and the dimension of this 
closure is dN . 

Proof. To prove that the complement of C c „ of C g is Zarisski closed, we need to 
show that the points P e C g arc characterized by being solutions of polynomial 
equations. This can be easily done (cf. [S5]). The coefficients defining Lagrange 
projectors P € C g are solutions to interpolation problems, hence, by Cramer's 
rule can be expressed as ratios of determinants, that re polynomials in their 
entries (cf. [S5]). The entries depend on N interpolation points in C d (dimension 
Nd). The upshot is that C g has a rational parametrization, hence the Zarisski 
closure of C g of C g is irreducible. Moreover, since C g is Zarisski open in an 
irreducible variety C g , it Zarisski closure coincides with its Euclidean closure 
(cf. [Mu]) which, by theorem 6.3, is the subvariety of B g that consists of Hermite 
projectors. ■ 
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Let Ti s C B s be the irreducible subvariety of B s consisting of Hermite projec- 
tors. We can now answer the question posed at the beginning of this subsection: 

Theorem 6.5 ([S5]) Letg be any basis for G. ThenH s is irreducible subvariety 
of dimension dim7i B = dN and B s is irreducible if and only if every P £ ^q 
is Hermite. In this case dim,B fl = dN. 

Proof. If H. s y^ B s then, since the complement C c . of £ B is a subvariety of B B , 
it follows that B s = H B U C c „ and B s is reducible. ■ 

6.3 Hermite and non-Hermite projectors 

Here is an immediate corollary of Theorem 6.3: 

Proposition 6.6 Let P € tyc- The following are equivalent: 

(AT) P is Hermite, i.e., can be approximated by Lagrange projectors 
(PDE) (kerP)- 1 is the limit of subspaces consisting of pure exponentials 
(LA) Mp can be approximated by a commuting sequence of diagonalizable 

matrices 

Corollary 6.7 If G C C[x, j/] then every P G tyc is Hermite and the scheme 
B g is irreducible. 

Proof. By Proposition 6.6, we only need to prove that every pair of commuting 
matrices can be approximated by diagonalizable matrices, since diagonalizable 
matrices correspond to Lagrange projectors. But this is a known fact from [MT]. 
For a nice proof of it see [BS] and [Gu] . ■ 

Theorem 6.8 If d > 2 then for some G C C[xi, . . . , Xd] the scheme B s is not 
irreducible, hence there are projectors P e *Pg that are not Hermite. 

Proof. (Essentially due to Iarrobino [la], cf. also [S5] and [S6]). Let U C C[[x]] 
be the span of ruffly half of monomials of degree n and let V be the other half. 
Now let E C C[[x]] be the space spanned by C[x] <n and polynomials of the 
form 

u - y^ a v , u v (6.1) 

vev 

for some coefficients a vu . Each such E is /^-invariant, hence for each such 
E there exists P <E ^P sp an{C[x]<„uy} with (keri 3 )- 1 =£we conclude that the 
dimension of B s is at least as large as the cardinality of the set {(u,v) : u <G 
U,v e V}. Easy calculations show that, for d > 2 and sufficiently large n, this 
cardinality is greater than d x dim(span{C[x] <n U V}) which, by Theorem 6.5 
proves the corollary. ■ 

Compare this theorem to the next proposition to see how different Vpc could 
be for different G of the same dimension: 

Proposition 6.9 Every curvilinear projector is Hermite. Hence for every N 
and d there exists G d C[x] such that B 2 is irreducible. 
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Proof. Let P be curvilinear and, without loss of generality the matrix M\ in 
Mp be nonderogatory. Then there are polynomials pi,i — 2 : d such that M, = 
Pi(Mi). Approximate Mi by diagonalizable matrices M^ . Then Pi (M^' ) are 
diagonalizable, approximate Mi and commute. Thus P is Hermite. Now for any 
TV let G = span{l, . . . , x x ~ } considered as a subspace of C[xi, . . . , xj[. Then, 
by definition, every P e CPg is Hermite. ■ 

The last proposition appeared in [S3] with a much more difficult proof. 

What about Gorenstein ideals? 

Theorem 6.10 (cf. [CN]) In three variables the subvariety of Gorenstein ideals 
in B g is irreducible. In particular every projector P with Gorenstein kernel is 
Hermite. 

And this leads to a very sad personal story: 

6.4 The tail of one projector 

In [S6] the author gave many convincing reasons why the projector Pq from 
C[x, y, z] onto its ten-dimensional subspace of quadratic polynomials G — C[x, y, z]<2, 
given by 

(ker P ) ± = V(x 3 - yz, y 3 - xz, z 3 - xy) (6.2) 

is not Hermite. It would have been great if it wasn't. You see, from the argument 
in the Theorem 6.8 it follows that in three variable there exists a subspace G of 
dimension 102 such that the corresponding B s is not irreducible. It is now known 
(cf. [CEVV]) that every G C C[x, y,z] with dimG < 8 induces an irreducible 
scheme B s . What happens between 102 and 8 is a wide open question. If my 
conjecture was true, it would have settle that question. Unfortunately it is not: 

Theorem 6.11 The projector Pq is Hermite. 

Proof. Consider a projector P from C[x, y, z] onto the span {C[x, y, z]<2 Uxyz} 
given by 

(kcrP)- 1 = V{-{x A + y 4 + z 4 ) - xyz). (6.3) 

The ideal kcrP is Gorenstein and by the theorem 6.10, P is Hermite. By 
proposition 6.6 (PDE) this means that there exist coefficients (a" ,j — 1 : 11) 
and points { (x™ , y , Zj ) — ► 0, j — 1 : 11)} such that 

10 1 

a ^ e ^x+y^\+z^z + Y^ a f) e -r-+yiy+ z i z - ^(x 4 + y 4 + z 4 )-xyz. (6.4) 

Multiplying both parts by e - ^" x+J/ n v+z n z ^> and letting 

(?,?,?) := {xf\y { p\zf) - (x^,y^,z^) (6.5) 
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for j = 1, . . . , 10 we have 

10 



«u + S o5 n) e^" )x+s i" ), ' +2 i B) * - J(x 4 + y 4 + z 4 ) - xyz. (5.6) 

i=i 

Differentiating with respect to D x , D y and D z we conclude that, the Lagrange 
projectors P n at ten points: 

(ij"'?,?)^ 3 (6.7) 

have the property that for every F <G (ker Po) ± there exist F n € (kerPn)- 1 such 
that F n — ► F. Once again by proposition 6.6 (PDE), this proves the theorem. 



6.5 Smoothness 

Definition 6.12 A scheme is smooth if at every point of the scheme the di- 
mension of the tangent space coincides with the dimension of the scheme. 

To compute the dimension of the tangent space of B g one has to differentiate 
the equations in the definition of section ??, which is a rather tedious task. 
Fortunately there is a somewhat of a short cut. 

Theorem 6.13 (cf. [MS]) For every ideal J € B s the dimension of the tangent 
space to B g at J is equals to 

dim Hom c[x] (J, C[x]/ J). (6.8) 

Let me explain what does the term in (6.8) means. Homc\ x ](J,C[x]/J) is 
all mappings S from the ideal J into the quotient algebra C[x]/J such that 

S(pifi +p 2 f2) = [viS.fi] + \p2Sf2] e C[x]/J (6.9) 

for all polynomials (!) pi,p2 € C[x]. 

For instance, let T be a Taylor projector from C[x, y, z] onto span of {1, x, y, z}. 
Then J := kerT is generated by monomials of degree 2 and 

C[x]/J = span{[l], [x], [y], [z]}. 

Now imagine that S G _ff otoc[ x i ( J, C[x]/J) is such that 

S(x 2 ) = o[l] + b[x] + c[y] + d[z] 
S{xy) = 01 [1] + bx [x] + C\ [y] + d\ [z] 

Since x 2 y e J and by (6.9) it follows that 

s (y(x 2 )) = a \v\ + h \ x v\ + 4y 2 } + d \y~A = a \v\ = 

S(x(xy)) = ai[x] +bi[x ] + Ci[xy] + di[xz] = a\[x\ 
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since all quadratic terms are in J. Thus a = a\ = and the rest of the coeffi- 
cients can be chosen arbitrary. In short, homomorphisms map every generator of 
J into an arbitrary linear combination of [x] , [y] and [z] . Since there are 6 gener- 
ators (quadratic monomials), the dimension of _ffomc[ x i(J, C[x]/J) = 6x3 = 18, 
but the dimension of B s = dim Gxrf = 4x3 = 12 and £> fl is not smooth. 

Now let us look at another example: 

The projector P is from k[x,y,z,w] onto the 8-dimensional subspace G 
spanned by g :={l,x, y, z, w, xz,yz, yw} defined by Pxw — yz and Pu = 
for the rest of monomials not in G. Computing Hom C [ x ](J, C[x]/J) we have 
have 

dim #om c[x] (J, C[x]/ J) = 25 < dimG x 4 = 32 

and this is interesting. The dimension of an irreducible algebraic variety cannot 
less than the dimension of the tangent space at any point. Hence 

Theorem 6.14 ([EI]) For q :={l,x,y, z, w,xz,yz, yw}, the border scheme is 
not irreducible and the projector P defined above is not Hermite. 

This is about the only explicitly known ideal projector that is not Hermite. 

7 Deserts 

Nothing tastes sweeter to a mathematician then an open problem. Here we will 
present a few of them. 

7.1 Gorenstein Ideals 

Problem 7.1 What is the relationship between the interpolating property of an 
ideal projector and the fact that kerP is Gorenstein? 

The proof of Theorem 6.10 in [CN] is very involved. 

Problem 7.2 Find a "linear algebra" proof of the following consequence of The- 
orem 6.10: Let (A,B,C) be a cyclic triple of commuting N x N matrices such 
that (A* , B* ,C*) is also cyclic. Then there exist triples of commuting diagonal- 
izable N x N matrices (A n , B ni C n ) such that (A n , B n ,C n ) — > (A, B, C). 

7.2 Hermite projectors in three variables 

Problem 7.3 What is the least N such that there exists an N -dimensional 
subspace G C C[x, y, z] and a projector P € *$q which is not Hermite? 

Such number must be between 9 and 102. 

Problem 7.4 Find one non-Hermite projector on C[x, y,z] explicitly. 
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Problem 7.5 (Guralnick [Gu]) What is the least N such that there exists a 
triple (A, B, C) of commuting N x N matrices that are not approximated by 
diagonalizable commuting matrices? 

That number must be between 9 and 30. Observe that the cyclicity assump- 
tion on (A,B,C) is omitted, which makes it a different problem from problem 
7.3. 

Problem 7.6 Find a triple (A,B,C) of commuting N x N matrices explicitly 
that are not approximated by diagonalizable commuting matrices. 

The next problem is of philosophical nature: 

Problem 7.7 What is so special about three(!) variables that make these ques- 
tions so much more difficult then the similar questions in two or four variable ?. 
Is three variables somehow on the cusp? 

7.3 Connectedness 

Problem 7.8 Is the border scheme B g , where g is the monomials basis for G, 
connected? In other words, given two ideal projector Pq,Pi <E tyo does there 
exist a continuous family of ideal projectors P(t) € ^?g such that P(0) = Pq 
and P(l) =Pi? 

For a monomial Z?-invariant space G C k[x] this problem was posed in [R2]: 
Since every irreducible variety is connected, the answer is affirmative for 
G C C[x,y\. The answer is also affirmative for those D-invariant monomial 
spaces G where deg/ > maxjdegg :jeG} for every / € dg. It is interesting 
to note that it follows from the general theory of Groebner basis (cf. [MS, 
Remark 18.3], [R2]), that "the Hilbcrt schemes" are connected. That is, for a 
given pair of ideal projectors P , P\ € tyc there exist a continuous family P(t) 
of ideal projectors, such that P(0) = Po and P(l) = Pi and dim ran P = dim G 
for all t. The rub is: for some t the ideal projector P(t) may project onto a 
subspace of dimension N that is different from G. 

7.4 Analysis (finally) 

Let P € *Pg be a Hermite projector. That means that there curves Z(t) = 
(zi(i), . . . zjv(t)), Zfe(i) e (C ) N such that the Lagrange projector P(t) € ^g 
interpolating at Z(t) converge to P. It is intuitively obvious that the trajectory 
of the curves Zfe(i) must "resemble" the surfaces b — Pb,b <G dg. In other 
words Given a Hermite projector P e Sjg and a family of Lagrange projectors 
P(t) — > P, what is the relationship between "trajectories" of Z(t) = V(ker P(t)) 
and P? 

Problem 7.9 Define "resemble" . 
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Illustration: Recall the two primary ideal projectors T and P* onto G = 
C[x,y] <1 given by (3.5). If all the points in Z(t) tend to the origin along the 
curve (t,t 2 ) C C 2 then the corresponding Lagrange projectors P(t) converge 
to P* . Computational evidence suggests that even a slight modification of this 
trajectory leads to a family of Lagrange projectors that converges to the Taylor 
projector T. In fact a "random" choice Z(t) — ► leads to a family of Lagrange 
projectors P(t) that converge to T. It seems that the Taylor projector is much 
more central in *Pg than P*. In other words, neighborhoods of T in *Pq have 
larger density than neighborhoods of P*. This suggests that, even in this simple 
case, the "geology" of tyo is not well-understood. 

Acknowledgement: I would like to thank Carl de Boor for teaching me all 
about the subject and patiently coaching on its finer points. I also want to thank 
Martin Kreuzer and Lorenzo Robbiano for exposing me to the algebraic theory 
of the border schemes. I want to thank my student, Tom McKinley for pleasant 
discussions we have while smoking outside the department. My appreciations 
go to many anonymous referees from algebraic geometry whose friendly, and not 
so friendly, criticism taught me a lot. Finally I want to thank my friend George 
for his hospitality during this and many previous conferences. 

References 

[B] H. Bass, On the ubiquity of Gorenstein rings, Math. Zeitschr. 82, 

(1963), 8-28 

[Bij Birkhoff, C, The algebra of multivariate interpolation, in Constructive 

approaches to mathematical models, C. V. Coffman and G. J. Fix (eds), 
Academic Press, New York, 1979, 345-363. 

[Bol] Boor, C. de, Ideal interpolation, in Approximation Theory XI, Gatlin- 
burg 2004, Chui, C. K., M. Neamtu and L. Schumaker (eds.), Nashboro 
Press (2005), 59-91. 

[Bo2] Boor, C. de, What are the limits of Lagrange projectors?, in Construc- 
tive Theory of Functions, Varna 2005, B. Bojanov, ed., Marin Drinov 
Academic Publishing House, Sofia, (2006), 51-63. 

[Bo3] Boor, C. de, Interpolation from spaces spanned by monomials, Adv. 
Comput. Math. 26 (1-3) (2007) 63-70. 

[BR] Boor, C. de, and A. Ron, On polynomial ideals of finite codimcnsion 
with applications to box spline theory, J. Math. Anal. Appl. 158 (1991), 
168-193. 

[BS] Boor, C. de, and B. Shckhtman, On the pointwise limits of bivariate 

Lagrange projectors, LAA, 429 (2008), 311-325. 

[CEVV] Cartwright, Dustin A., Daniel Erman, Mauricio Velasco, Bianca Viray, 
Hilbert schemes of 8 points in A d , arXiv:0803.0341. 



SHEKHTMAN: IDEAL PROJECTORS 147 



[CN] Casnati, G. and R., Notari, On the Gorenstein locus of some Punctual 
Hilbert Schemes, arXiv:0803.1135. 

[Co] Cox, D., Solving equations via algebras, in Solving Polynomial Equa- 

tions, Foundations, Algorithms, and Applications, (A., Dickenstein and 
I., Z., Emiris cds.) Algorithms and Computation in Mathematics, 14, 
Springer 2005, 63-123. 

[CLOl] Cox, D., J. Little and D. O'Shea, Ideals, Varieties, and Algorithms, 
(second edition), Springer- Verlag, New- York-Berlin-Heidelberg, 1997. 

[CL02] Cox, D., J. Little and D. O'Shea, Using Algebraic Geometry, Graduate 
Texts in Mathematics, Springer- Verlag, New- York-Berlin-Heidelberg, 
1997. 

[EI] Iarrobino, A., and Emsalcm, J., Some zero-dimensional generic sin- 

gularities; Finite Algebras Having Small Tangent Space, Compositio 
Mathematica, 36(2), 1978, 145-188 

[Fo] Fogarty, J., (1968): Algebraic families on an algebraic surface. Amer. 

J. Math., 90:511-521. 

[Ge] M. Gerstcnhaber, On dominance and varieties of commuting matrices, 

Annals of Mathematics 73 (1961), 324-348. 

[Gu] Guralnick, R., A note on commuting pairs of matrices, Linear and 

Multilinear Algebra 31 (1992) 71-75. 

[GS] Guralnick, R., and B. Scthurman, Commuting pairs and triplets of 

matrices and related varieties, Linear Algebra Appl. 310 (2000) 139- 
148. 

[la] Iarrobino, A., Reducibility of the families of 0-dimensional schemes on 

a variety, Inventiones Math. 15, (1972), 72-77. 

[KR] M. Kreuzer and L. Robbiano, Computational Commutative Algebra 2, 
Springer, Heidelberg (2005). 

[Le] Lefranc, M., Analyse Spectrale sur Zn, C. R. Acad. Sci. Paris 246, 

(1958), 1951-1953 

[Ma] Macaulay, F. S., The algebraic theory of modular systems, Cambridge 
University Press, 1916 (reprinted 1994). 

[MSh] McKinley, T., and B. Shckhtman, On simultaneous block- 
diagonalization of cyclic sequences of commuting matrices, Linear and 
Multilinear Algebra, DOI: 10.1080/03081080802443125 

[MS] Miller E,. and Sturmfcls B., Combinatorial Commutative Algebra, 
Graduate Texts in Mathematics, 227, Springer 2000. 



148 SHEKHTMAN: IDEAL PROJECTORS 



[M] Moller, H. M., Hcrmitc interpolation in several variables using ideal- 

theoretic methods, In Constructive Theory of Functions of Several 
Variables, W. Schempp and K. Zeller (eds.), Lecture Notes in Mathe- 
matics, Springer, (1977), 155-163. 

[Mo] B. Mourrain, A new criterion for normal form algorithms, in: Applied 
Algebra, in Algebraic Algorithms and Error- Correcting Codes (13th 
Intern. Symp., AAECC-13, Honolulu, Hawaii USA, Nov.'99, Proc), 
Mark Fossorier, Hideki Imai, Shu Lin, Alan Pol(eds.), Springer Lecture 
Notes in Computer Science, 1719, Springer- Verlag, Heidelberg, 1999, 
430-443. 

[MT] Motzkin T., S., and, O. Taussky, Pairs of matrices with property L. II, 
Trans. Amer. Math. Soc. 80 (2) (1955) 387-401. 

[Mu] Mumford, D., The Red Book of Varieties and Schemes, Lecture Notes 
in Mathematics 1358, Springer- Verlag 1988. 

[N] H. Nakajima, Lectures on Hilbert schemes of points on surfaces, Amer. 

Math. Soc. University Lecture Series, vol.18, Providence RI, 1999. 

[Rl] L. Robbiano, Zero-dimensional ideals or the inestimable value of es- 

timable terms, in Constructive Algebra and Systems Theory, B. Han- 
zon, M. Hazewinkel (Eds.), Royal Netherlands Academy of Arts and 
Sciences, Netherlands, 2006, pp. 95-114. 

[R2] L. Robbiano, On border basis and Groebner basis schemes, 

arXiv:mathn0802.2793v2. To appear in Collectanea Mathematica, Vol 
60(1). 

[SI] Shekhtman, B., On one Question of Ed Saff, Elec. Trans. Numcr. Anal., 

Vol 25, (2006), 439—445. 

[S2] Shekhtman, B., On a conjecture of Carl de Boor regarding the limits 

of Lagrange Interpolants. Constr. Approx. 24(3), (2006), 365-370. 

[S3] Shekhtman, B., Bivariate ideal projectors and their perturbations, 

Adv. Comput. Math., Volume 29, Number 3, (2008), 207-228. 

[S4] Shekhtman, B., On perturbations of ideal complements, in: B. Ran- 

drianantonina, N. Randrianantonina (Eds.), Banach Spaces and their 
Applications in Analysis, De Gruyter, Berlin-New York, 2007, pp. 413 
422. 

[S5] Shekhtman, B., On the limits of Lagrange projectors, Constructive 

Approximation, doi: 10. 1007/s00365-008-9016-0 

[S6] Shekhtman, B., Ideal Interpolation, Translations to and from Algebraic 

Geometry, Approximate Commutative Algebra, to appear in "Texts 
and Monographs in Symbolic Computation". 



SHEKHTMAN: IDEAL PROJECTORS 149 



[St] Stetter, H.J., Numerical Polynomial Algebra, SIAM, Philadelphia 

2004. 

[Stu] Sturmfels, B., "Four counterexamples in combinatorial algebraic geom- 
etry", Journal of Algebra 230 (2000), p282-294. 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS , VOL.8, NO.1, 150-164,2010, COPYRIGHT 2010 EUDOXUS PRESS, LLC 



SEMIPARAMETRIC ESTIMATORS FOR HEAVY TAILED 

DISTRIBUTIONS 

Valeria Caviezel 

Department MSIA 

University of Bergamo, 

Via dei Caniana, 2 - 24127-Bergamo- Italy 

E-mail: valeria.caviezel@unibg.it 

Sergio Ortobelli 

Department MSIA 

University of Bergamo, 

Via dei Caniana, 2 - 24127-Bergamo- Italy 

E-mail: sergio.ortobelli@unibg.it 

Prof. Svetlozar (Zari) T.Rachev 

Chair of Econometrics, Statistics 

and Mathematical Finance 

School of Economics and Business Engineering 

University of Karlsruhe and KIT, 

Kollegium am Schloss, Bau II, 20.12, R210 

Postfach 6980, D-76128, Karlsruhe, Germany and 

Department of Statistics and Applied Probability 

University of California, Santa Barbara 

CA 93106-3110, USA and 

Chief Scientist of FinAnalytica INC. 

E-mail: rachev@statistik.uni-karlsruhe.de 



Abstract: In this paper we describe and apply estimating function theory to 
evaluate the parameters of parametric distributions uniquely defined by their 
characteristic functions. We first implement an estimating function model based 
on the first four moments of a parametric function of the underlying random 
variables. For instance we propose two parametric functions of the underlying 
random variables so as to obtain its first moments more easily by the simple 
knowledge of the characteristic function. Thus we consider the estimates that 
present the minimal asymptotic variance with respect to the parameter of the 
function. Then we propose an empirical analysis based on simulated stable 
Paretian distributions. Using simulated data of stable distributions we evalu- 
ate the forecasting power of the proposed methodology comparing it with the 
analogous maximum likelihood estimates. Moreover we show how to apply the 
same methodology to some well-known infinitely divisible distributions. 
Key words: Estimating function, Stable distributions, infinitely divisible dis- 
tributions, asymptotic variance. 
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1 Introduction 

Several empirical investigations have shown that empirical distributions of many 
observed economic and financial data deviate from the ideal Gaussian law, since 
they often exhibit skewness and fat tails. Thus many alternative distributions 
have been proposed to approximate the underlying data. The classical example 
is the case of infinitely divisible distributions uniquely defined by their character- 
istic functions. Infinitely divisible distributions take into account the skewness 
and kurtosis of return series. In addition their associated Levy processes can be 
used for valuing intertemporal financial products such as the derivatives. Thus 
several Levy processes have been widely used in recent financial literature (see, 
among others, Rachev and Mittnik (2000) and the references therein). In par- 
ticular, the stable Paretian distributions probably present the most attractive 
modeling properties, since they not only provide a better empirical fit, but they 
also posses heavy tails and result as limit distribution of sums of i.i.d. random 
variables 

In this paper, we deal with the problem of valuing the parameters of dis- 
tributions uniquely defined by their characteristic function. Typically, if we 
know the characteristic function, we can determine the approximated distrib- 
ution by inverting the characteristic function with the Fast Fourier Transform 
(FFT) and then we can value the maximum likelihood estimates (MLE) of the 
parameters. However, even if this methodology is widely used it does not per- 
mit an a priori valuation of the efficiency of the estimates. In this paper we 
propose a semiparametric methodology that gives optimal estimates of distribu- 
tion parameters based on a valuation a priori of the asymptotic variance of the 
estimates. The semiparametric valuation is based on estimating function (EF) 
theory (see Godambe (1991) and the references therein). We suggest using this 
estimation either for the moments curve or for a bounded parametric function of 
the underlying distribution that admits finite the first four moments. Thus we 
propose to minimize the asymptotic variance of the estimates subject to some 
estimating equations. Finally we value the forecasting power of some of these 
semiparametric estimators applied to stable paretian distributions. In partic- 
ular, we compare the absolute difference between simulated data, and either 
EF estimates or MLE ones obtained for stable paretian parameters. We also 
show how to use the same methodology for some particular infinitely divisible 
distributions. 

The paper is organized as follows: Section 2 introduces semiparametric es- 
timators based on estimating function theory, and in Section 3 we compare 
semiparametric and maximum likelihood methods. Finally, we briefly summa- 
rize the results. 



2 Estimating Function parameter estimation 

Suppose we have a sample X = (X\,X2, ...,Xt) of i.i.d. observations whose 
distribution family 9(0) is parametrized by 9 = (9i,02, ■•■, P ) GfiC R p . In 
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the theory of estimating functions the optimum has two components: "unbi- 
asedness" of EF and "smallness of the variance" of the standardized EFs. An 
estimating function hi (X, 9) is called unbiased if E(hi (X, 9)) — for all admis- 
sible 9. In particular, when we consider a single parameter 9 the score function 
— ^M — — represents a the typical example of unbiased estimating function. We 
say that the unbiased estimating functions hi(X s ,9) are mutually orthogonal, 
when E(hi(X s ,9)hj(X s ,9)) = for every i ^ j',i,j = 1, ...,n. Among all the 

T n 

linear combinations lo.k = 5Z 12 a k,i(9)hi(X s ,9), (k — l,...,p) of unbiased, 

s=li=l 

mutually orthogonal estimating functions hi(X s ,9), the estimating functions 

T n / dh\ 

l*e,k = EE<»m(^(^.«) with coefficients a^O) = E l-^-j /E (h 2 ) , 
Vi = l,...,n; Vfe = l,...,p, are optimal (with minimum variance). According to 
estimating function theory the optimal EFs 9(t) — [9(t),i, ■ ■■, @(T),p] obtained as 
consistent solution of the system: lg k = 0, k = 1, ...,p; after orthogonalization, 
standardization and optimal combination is asymptotically Gaussian, i.e., 

T ($ {T) - *) -> MVN(0, V E ^{9)) for T -> oo, (1) 

(dl* ei \ 

where V E f{9) = N,j]i,j=i,...,p and v i, 3 = E ( ^r I ; *> J = 1> —,P- These results 

can be extended to stationary series that are not necessarily independent (see 
Li and Turtle (2000)). Moreover, we get similar results using GMM (General 
Method of Moments), but in this case we need a recursive methodology to 
determine the parameters in the linear combination l* e ■ . Thus any algorithm 
based on GMM needs additional computational time. 

Typical examples of optimal estimating functions are those proposed by 
Godambe and Thompson (1989) where they use two unbiased and mutually 
orthogonal estimating functions: h\{X s ,6) = f(X s ) — m/(0) and hi{X s ,9) = 
(f(X s ) - m f {9)) 2 - aj(0) - 8f($)<T f (0) (f(X s ) - m f {9)) , where / is a measur- 
able real function, m f (9) = E(f(X„)), <r}{9) = E ((f{X s )) 2 \ - m}{9), and 



EUf(X s )-m f (0)Y 
s /(^) = — 377J\ ■ Therefore, we get the EF optimal estima- 

tor 9(r\ of the vector of parameters 9 solving the equations (for k — l,...,p) 

T 

l *ek=T, (ak,ihi(X 8 ,$) + a k< 2h2(X s ,e)) = 0, where 

s=l 

E ( dh 1 (x s ,e) \ dm f (e) 

_ __V de k J _ dek 

ak - 1 ~ E{h\(X s ,9)) a}(9) ' [Z) 



\-^~) - d -^? + d -^s f (0)a f (e) 



E ( dh 2 (x 3 ,ey 

E{h 2 (X s ,9)) oj(0) (k f (0) - 1 - a}(0J) 
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E((f(X s )-m } (9)f) 

and kf{6) = — j— — — . We call this class of estimating func- 

a f (0) 

tions GT (Godambe and Thompson) estimating functions. Note that we can 

also easily obtain an analytical formulation of the asymptotic variance of GT 

estimating functions when we know "It and — J* — . As a matter of fact, 

from (f) we deduce that the asymptotic variance V^ F {0) is the inverse of 

I ' dl* ■ \ 
V EF {6) = [Vi,j(6)] iij=li ... iP , where v itj (0) = E f — |^ J . Thus, we get 

dm; (6>) dm f (6) 



-T- 



jg + gg^gMg) (-gg + g^gMg) (4) 



It has been demonstrated (see Ortobelli and Topaloglou (2008)) that the solu- 
tions of GT estimating functions l* e k = are given by the values 8 k , solutions 
of the simplified equations for k — 1, ...,p : 



m f (0) = < 



where 




i £/(*.) + <*-&* ifc fe >0 

1 s=l 

i£ /(*») + <* + &* ifc fc <0 

J s=l 



T 



(5) 



6*= |4-^VI/(X s )-iv/(X s )) + ai(0)\ , and 



s=l 



Cfc = 



Qfc.l - CLk,2Sf(0)o-f(0) 
2flfc,2 




In particular, when m/(#) is itself a possible parameter, then a consistent es- 

+• +■ t +■ r to\ ■ ■ v. fr.\ v. a m ,l-a m ,2Sf(0)(Tf(6) 

timatmg function of mf(0) is given by (5) where c k = — - 



2a 



rn.2 



-i „ __ -^ + sAe)a f (e) .^, w 



"m,l — 2rn\ ' a m,2 — 7 \" dllU flm , _ Sm,(I) 1(JI d 

given k. According to formulas (2) (3), the above estimating equations (5) are 

uniquely determined by T7lf(0), o- 2 A9), Sf(0), kf(0), "g and— gg — that are 
respectively the mean, the variance, the skewness and the kurtosis of f(X), and 
the partial derivatives of m,f(0), and o- 2 A9). Thus, if X admits finite the first 
four moments, we can use the function f(X) = X to get some estimators of 
the distributional parameters. Otherwise, we can use a bounded function /. 
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Moreover, in several cases it could be useful to adopt a parametric differentiable 
function / : A — > R, where A = R x [a, b] C M 2 . As a matter of fact, in this 
case the previous estimators and functional (i.e., rrif(6, q), o-f(9, q), Sf(9, q) and 
kf(9,q)) also depend on a parameter q G [a 7 b] of the function f(X,q). This 
aspect can be used to get an optimal estimator with the minimum asymptotic 
variance as suggested in the following remark. 

Remark Suppose we have a function f : A — ► R (with ACM 2 ) such that for 
any real random variable X G 9(0) (with 9 G B C M p ) f(X,q) admits finite the 
first four moments for any admissible q. Then we get an optimal GT estimator 
with minimum asymptotic variance solving the following optimization problem: 

max \Vef(9, q)\ subject to 
(5), fc = l,...,p, 0€5; 

where \Vef(0, q)\ is the determinant of the inverse asymptotic variance (1) 

fdl* ei \ 
Vef(Q,i) — [ v i j(9,q)]i j=i P and Vij(9,q) = E ' . In order to reduce 

V Wj / 
the computational complexity of (6) we can use X)?=i v i,i(@' l) as objective func- 
tion instead of \Vef{9,q)\ ■ 

Clearly the idea of minimizing the asymptotic variance subject to the con- 
straints of some equations can be applied to many other estimators (for example 
the maximum likelihood estimator). Using a GT estimator we do not necessarily 
need to know a closed form of the density (or the cumulative) distribution of the 
underlying random variables. As a matter of fact, GT estimators can be used 
even for those distributions uniquely defined by their characteristic function 
4>x{t) = E(exp(itX)). Let us consider two possible examples. 
Moment estimator: Suppose all parametric random variables belong to the 
space U = {X/E (\X\ r ) < oo} . By using the derivatives of the characteristic 
function (/> x '(0) we can determine all the existing integer moments of X, since 
<j> x (0) = i k E(X k ). Then, as parametric functions we can use: 

• f(X,q) = \X\ q for any q G [0,r/4] (in this case m f (8,q) = E(\X\ q ) 
represents the moments curve that characterizes the distribution of \X\); 

• f(X,q) = X q for any integer q G [0, r/4]. 

Trigonometric estimator: Suppose f(X,t) = sin(tX) for some given t ^ 0. 
Then if we know the characteristic function 4>x (t) = E (exp (itX)) we can easily 
determine the first four moments of f (X,t) , since E(cos(tX)) — ~Re(4>x(t)); 
E(sm(tX)) = 1m(<f>x(t)). So, if f(X,t) = sin(tX) for a t # the first four 
moments of sin(tA) are given by: 

E(sin(tX)) = Im (<px(t)) ; E (sm 2 (tX)) = 0.5 (1 - Re (4>x@t))) 

£(sin 3 (^)) =i(3Im(0 x (t))-Im(</, x (3i))); (7) 

E (sin 4 (tX)) = i (3 - 4 Re (4>x(2t)) + Re (cf> x (U))) ; 
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and together with these we have to know m I} ' ' = — Z, x and —J* — = 

o dtik atik o9k 

-0-5dRe(^ x (2t)) _ 2i m (^ x ( f )) £Mi. This method can be easily applied 
to estimate the parameters of all distributions defined by their characteristic 
function, when from the characteristic function we can easily distinguish the 
imaginary part from the real one. For example, all the infinitely divisible ran- 
dom variables X have characteristic function (f>x(u) uniquely determined by 
the triplet [7, o 2 ,v\ that identifies the so called Levy-Khintchine characteristic 
exponent ipx (u) = log <f>x (w) given by: 

ipx(u) = i-fu - ^ 2 u 2 + f_™(exp(iux) - 1 - iuxl { \ x \ <1 })i'{dx) = 

= i hu + J^(sm(ux) - uxl {lxl<1} )v(dx)J - 

- 2 a2u2 + /- ~ ( c os(ux) - 1) v{dx) 

where 7 € R, a 2 > and v is a measure on M\{0} with J_ (1 A x 2 )v(dx) < 00. 
Therefore for any infinitely divisible random variables X we can easily identify 
the real and the imaginary one of the characteristic function given by: 

Re {(j)x{u)) = exp [ — -u 2 u 2 + / (cosfwa:) — 1) v{dx) I x 



( f + °° ■ \ 

x cos I ju + / (sin(ux) — ua;l{| x |<i} )v(dx) , 

\m{4>x{u)) = exp I ——o~ 2 u 2 + / (cos(ux) — 1) v(dx) I x 

/ r+00 \ 

x sin I 7U + / (sin(ux) — uxl^ x \ < iy)v(dx) 
\ J — 00 / 



In particular, the Levy triplet [7, er 2 , v] identifies the three main components of 
any Levy process: the deterministic component (7), the Brownian component 
(er 2 ) and the pure jump component {v). For further details on theoretical 
aspects we refer to Sato (1999). 

2.1 Examples of infinitely divisible distributions 

In this subsection we consider some particular infinitely divisible distributions: 
stable Paretian distributions, tempered stable (TS) distributions, normal in- 
verse Gaussian (NIG) distributions, and Carr, Geman, Madan, Yor (CGMY) 
distributions. For each of these distributions we describe: 

• the characteristic function distinguishing the real and imaginary parts; 

• the moments curve in the stable Paretian case and the mean, the variance, 
the skewness and the kurtosis for the other infinitely divisible distributions; 
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^ *"/(») -^^W 

ment estimator; 



• in Table I we give the derivatives „g and— -A — for the GT (EF) mo 



• in Tables II and III we give the derivatives — gg and — gg f° r 

the GT (EF) trigonometric estimator. 

Doing so, we are able to derive: 

a) a GT trigonometric estimator when f(X, t) = sin(iX) for all these distribu- 

tions; 

b) a GT moment estimator when f(X) = X for TS, NIG and CGMY distrib- 

utions; 

c) a GT parametric moment estimator when f(X,p) = \X — /i\ p for stable 

Paretian distributions. 

Stable distributions (see, among others, Rachev and Mittnik (2000) and the 

references therein) A univariate stable distribution X = S a (a, /?, /i) is charac- 
terized by four parameters. These are: the index of stability a <G (0,2], the 
scale parameter a <G M. + , the skewness parameter /3 <G [— 1,+1] and the shift 
parameter /i £~R. The stable distribution is Gaussian when a = 2, and in this 
case, a is proportional to the standard deviation, f3 can be taken to be zero 
and /i is the mean. An a stable non Gaussian distribution admits finite p-th 
fractional moment -E(|X| P ) < oo for any p £ (— l,a). A stable distribution 
can be defined in different equivalent ways. We use the characteristic function 
extensively because few stable density functions are known in closed form. The 
characteristic function of X = S a (a, f3, fi) is given by: 



4>x(t) 



exp ( — \ta\ a ( 1 — zsgn(t)/3tan — - ) + itji j if ae(0,2];«^l 
exp I — \ta\ I 1 + if3— sgn(t) log \t\ I + it/i I if a = 1. 



Thus, Re (</>x{t)) — exp (— \ta\ a ) cos(b); Im (<fix(t)) — exp (— \ta\ a ) sin(t>), where 

7TCX 

|io-| Q sgn(i)/3tan— +t/j, if a G (0,2]; a#l 

2 

t/u, — \ta\ 0— sgn(t) log It! if a = 1, 

7T 

and we can apply formula (5) to approximate a, a, f3, \i with the GT trigono- 
metric estimator when we assume f{X,t) — sm(tX). Thus using formulas (2) 
(3) (7) and (4) we can solve the problem (6): 

max |Vef(o:, a, /3,/j,, t)\ 
subject to (5); t ^ 0, 
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Table 1 



This table summarizes the derivatives used in the moment estimators for TS, NIG, CGMY and Stable 
distributions. 



X = TS(k,a,b) 



dVar(X) , ,,„ k + 2(\nb-k 2 -klnb 

— — = Aab a ~-' lk ; 

dk k 

^m^k(l-k)b-" 
da y ' 

8Var ^K 4a(l-k)(k-2)b^ 



6k { k 

2kb ' 

da 



X =NlG(a,p,S) 



i'E(X) 

as 

dE(X) 

Bf) 

dE(X) 
da 



= a 2 S[a 2 -p 1 ) ' 
= - a /}S(a 1 -p 1 )"' 



dVar(X) 

do 
dVar(X) 



= la 2 fiS(a 2 -p 2 y 



dVar(X) 
da 



-aS la-+2/3 2 a -0 



X =CGMY(C,G,M,Y) 



<~B X i 

dC 
dE(X) 

dG 
dE(X) 

dM 
c)E{X) 



= (m"-' -G'-')r(i-y) 

= CG v ' 2 r(2-Y) 
= -CM'- 2 r(2-Y) 
Cr(l-Y)UM y -'lnM-G y -'lnG)- i t'(l-Y)(M y -'-G y -')\ 



dVar(X) 

dC 
dVar(X) 

SG 
dVar(X) 

dM 
dVar(X ) 



= {M'- 2 +G , - 2 )r(2-Y) 
= -CG'-'r(3-Y) 
= -CM'- 2 r(3-Y) 

Cr[l-Y)UM V -lnM + C y ? lnC)-!/'(2-y)(M f 2 +G* 



X = S a (fi,a, M ) 



se(\x\') 



-A(p)po- F 11+/? r ) cos}' 



8e(|X|") psTro.5^r(l+^V)(2cosT + (M. + v.)cosa- | )-(l+/? 2 )sinj + 0.5(H.-i.)(l- ; « 2 )sma- 1 "] 

8/1 pa a(l + /3V)(l + g + /l 2 -g/3 2 ) 



da a^x \ a )■ 



" 2(l + AV) 



siny + — y\ cosy 4 



xpfi^l+r 2 ) ln(l + /?V 2 )'| 



where: q = cosher r=tan A(p) = 



v =7ra 2 + /?arctan(/?r) w = ko 1 - /?arctan(/?r) V = — arctan(/?r) 



2 , r f«^£j r f£ + l 



2-P 
P 
_ p 



1+0V 


a 




f 




p/(2a 




q + f} 2 - 


-qfi'+l 






2 co 
V 


2 Ka 

2 J 
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Table II 



This table summarizes the derivatives used in the trigonometric estimators for TS, and CGMY 
distributions. 



CGMY(C,G,M,Y) 


wAere 


^-^ = nr(-Y)Ircos(p + \i-w)-ct>s(v-w)(M r +G r )+scos(q-v-nv)~\ 

cRc nCYr{-Y)r . , . , ,/,„., „„i , ,1 
= 22 trsm(p + v~w)-cos(v-iv)(t 2 G" +G ) + rGcos(p + v-w) 


p - Farc/g — 

a - Farc/g — 
M 


8 Re nCYT(-Y)r . . , , >, , ,., „> , .-i 
= = — \tssm [q-v + w -cos v-w) f M +M j + sM cos [q-v + w)\ 




r = J(G> + r)> 


-5^- = nCr(-K) jcos(v - h>) [V (-F) (g 1 ' +M '' )-M'lnM-G' In G]-rcos(p+v- w) 
•F(-Y)- — lnr -jcos(o-v + w) V (-Y)--lns \-H-cos(p + v- iv)- — cos(o -v+ w) 1 

L k J L i- J r r J 


>'<i( M2+ ' 2 )' 


v = rCr(-Y)sinp 


= nr(-F)[rsin(p+v->l')-sill(v-K')(M 1 '+G')-jsill(5-v+vi')] 

Sim nCl'/'t-F) r / , / \t 7 v i v t\ / \1 
— =— r ^[- ! rco4p +V - W )-sm{ V -w)(rG y - , + G Y - [ ) + rGsm{p +V - W )\ 


w =sCr (-F)sina 

n- e L 


dim nCYr(-Y)[ , , . , >/,„ n ,,y+i\ ,» ■ / \1 

= = — = — tscQs{q-v+w)-sm[v-w)[rM +M \-sMsm[q-v + w) 

6M M 2 +t 2 L vy ' y n ' V 7 J 




— — = nCr(-Y)Lm(y - w)\ l F(-Y)(G Y + M Y ')- M Y In M -G Y lnG~\- r sin(p + v - w) 
V(-F)- — lnr + ssin(o-v+ w) f (-10- — In J + -^-cos(p + v-w)-— COS(o-v + w)\ 
TS (k,a,b) 


w/rere 


3 Re 

= /wcos (ay sin z)- yivcosp 

da 


s - k arctan(2/fo _1/ * ) 


= cos {ay sin z 6 ' + 4rfo ( ' )-b vcos/;-2//r ysmp 




p = z — ay sin z 


a* ^ 2/( +4^ 2 L w " L ''^ ■' " '"■' > ' J '~'" " n 


dim , . , , 

fowsinl ay sin z + wsin p 

8a v ' 

dlm aw r ■ i \d.i**w* A^u9+m\ 1.3/* ■ , A w 1 


w - exp{a(fo- >>cos z)\ 


Sim ay W r , n> - + 4 ,Mny b llk Info) + co S/ ;(2/fo l/A Info + zb 1/k +4zt 2 )] 
dk kb Jk +4kt 2 L y ' V 'J 
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where — £)* an< ^ — oe are & ven m Table III. Moreover, we can propose 

a GT moment estimator based on the moments curve. As a matter of fact the 
absolute central moments for any p £ (— l,a), a ^ 1 are given by: 

E (\X - fi\ p ) = a p (l + /3 2 tan 2 ^°"'" '" cos (I arcta n f/3tan ^)) A(p), 



r ' 



+00 



where A(p) = — j- — > — and T (c) = J z c e z dz for c > 



2 

while we refer to Abramowitz and Stegun's definition (see Abramowitz and 
Stegun 1970) of the Gamma function V (.) of a negative real (non integer) num- 
ber. Thus, if we use f(X) = \X — yu| p such that p is small enough (i.e., p G 

(—0.25, a/4)) we can easily get the first four moments E(f(Xy) = E I \X — n\ 3P ) 
j = 1, ...,4, in order to obtain a GT moment estimator for the stable parame- 
ters. Thus we can apply formulas (2) (3) (using — ga ? — da and 
— gg~ given in Table I) to determine estimating functions (5) for a, a, j3. 
Since we need four equations for four parameters and we do not know a closed 
form of E (\X — /i\ p sgn(A — fi)) , we suggest using, as fourth estimating equa- 
tion, the consistent estimating function of irif(6) (5) assuming that rrif(9) is 

itself a possible parameter. Since c5(a, a, 0) — E ( \X — fi\ p J — mj (a, er, /3) 2 , 
then — f „ — — »„,/!"„ m ; and m 4 = 1, and the fourth estimating 

T 

equation is /^ = Y. ( a i h i( X s,9) + a 2 h 2 (X sl 9)) = where 01 = ^ ( ~ 1 J>|3 ) , 

_^M +S/ (0) CT/ (0) 

a2 = 7-^ r-. Analogously we get the elements of the as- 

aj(9)(k f (9)-l-s}(9)) 

ymptotic variance (4). Doing so, we can maximize the inverse of the asymptotic 
variance determinant max \Vef{ol, a, /3, (J-,p)\ associated with the GT mo- 

ment estimator of stable paretian distributions. 

Tempered Stable (TS) distribution (see Tweedie (1984) and, for more gen- 
eral definitions, see Kim et al. (2008)) Tempered stable distributions depend 
on three parameters a>0;6>0;0<fc<l and their characteristic function is 
given by: 

(J>ts {u;k,a,b) = exp I ab — a ( b ' — 2iu ] 



Thus, He(<fix(t)) = ccos(d); Im(4>x{t)) = csin(af), where 



exp fab - a (b 2 ' k + 4£ 2 ) 



k/2 cos (k arctan =£ 



-a (b 2 / k + At 2 ) k/2 sin ( k arctan — ^ 
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Moreover, if we assume f(X) = X, the mean, the variance, the skewness and 
the kurtosis of tempered stable distributions are given by: 

m f (a, 6, k) = 2akb^ k - 1 ^ k ; aj(a, 6, k) = Aak{\ - k)b^ k - 2 ^ k : 

, x. ^ ( fc - 2 ) , < l ^ o 4fc - 6 - fc(l - fe) 

s f (a,b,k) = i J — r7 ^; k f (a,b,k) = 3 + \, — '-. 

[abk(l - k)} 1/2 abk(l-k) 

Therefore, considering the derivatives T »{ , —dz — — m it and — c it '' 

given in Tables I and III, we can obtain GT moment and trigonometric estima- 
tors. 

Normal Inverse Gaussian (NIG) distribution (see Rachev and Mittnik 
(2000) and the references therein) A NIG distribution depends on three para- 
meters a > 0; 5 > 0; £ (—a, a) and its characteristic function is given by: 



0at/g(u; a, 0, S) = exp <^ -5 ( \J a 2 - (0 + iu) - \J a 2 - 0- 
Thus, Re((f>x{u)) = ccos(d); Im(<j>x{u)) — csin(d), where 

(s^a 2 - 2 - 6 ((a 2 - 2 + u 2 f + 4u 2 /3 2 ) "'^ cos (e) 



c = exp 



0.25 



d = -5 ((a 2 - 2 + u 2 ) 2 + 4u 2 2 ) ' sin (e) , 

e = 0.5arctan ^ , . 

a 2 - 2 + u 2 

Moreover, if we assume f(X) = X, the mean, the variance, the skewness and 
the kurtosis of Normal Inverse Gaussian distributions are given by: 

m f (a,0, 8) = 60 (a 2 - 2 )' 1 ' 2 ; a 2 = a 2 6 (a 2 - /3 2 )~ 3/2 ; 

f R x\ 3/3 / a 2 +40 2 \ 

s f (a,0,S) = — ——7===; k f (a,0,8) = 3 1 + 



av 7 ? • v/a 2 - 2 ' ' ' V Sa 2 ^/a 2 -0 2 J ' 

The derivatives ^M, ^^ aim ^* W) and 0Rc ^ (t)) are given in Tables I 
and III. 

The Carr, Geman, Madan, Yor (CGMY) distribution (see Carr et al. 
(2002)) A CGMY distribution depends on four parameters C, G, M > 0; Y < 2 
and its characteristic function is given by: 

0cgmy(u; C, G, M, Y) = exp {CT(-Y) [(M - iu) Y - M Y + (G + iu) Y - G Y ] } 
Thus, He(<px(t)) = ccos(d); Im(4>x(t)) = csin(af), where 

- M Y - G Y 



c = exp <^ CT(-Y) 



+ (M 2 + t 2 ) Y/2 cos ( Yarctan -| J + 



(G 2 +t 2 ) Y/2 cos( Yarctan-^ 
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Table III 



This table summarizes the derivatives used in the trigonometric estimators for NIG, and Stable 
distributions. 





S a (fl,<r,M) 






where 


S Re 
da 


= p /?(# + 2<5 , ln|/o'| + ^-g 2 Jsgnf sinm + 21n Ifcr cosm 






p = \tafe~^ 


3Re 


= -pq&gntsmm 






71a 
q = tan 

2 


a Re 

a Re 
da 


= -fe ' ,fr ' sinm 

= -ay?r sgn (to) [cos m + /?<? sgn t sin m] 






m = (// + pq \t(X\ sgn(/) 


aim 

da 

aim 
Up" 


= — p\ pin + 2q\n\to\ + 7tq 2 \sgritco&m-2\n\ta\$m.m 
- pqsgntcosm 








aim 

1/T 


= te ''"' cosm 








aim 


= -apt sgn (ter) [sinm -/?gsgnfcosm] 

NIG(a,0,S) 






where 


BRe 
da 


= — n(ft 2 -a 2 -f 2 }cos# + m 3 cos (Sm sin/?)- 2tn/3sinq 

nm L V / J 








m = fr+a' + tr--n(r-cr-+fli>-a'p') 


a Re 
~d~p" 


= A nftia 2 - p 2 -t 2 )cosq - pm cos (^m sin p)+ ntia 2 + p 

nm \- • ' v 


+ t 2 


)sin ? ] 




n = yja 2 - fi 1 


BRe 
dS 


= w\ ncosiSmsin p\ — mcos q 






1 Tip 


P l""* 8 ?+<?-/? 


aim 

da 

aim 


= — nice 2 - 1 + t 2 )sinq + m* sin(<Jmsin p\- 2tnj3 cos q 

nm <- J 

= - nfiifi 2 -a 2 + t 2 )smq-fim 3 sin( Smsin p)+nt (a 2 + 1 

nm' L v 


+ r 2 


cos ? ] 


q = p — 8msinp 


aim 

~dS~' 


= w\ n sin (Sm sin p)+ msinq | 









d = CT(-Y) (M 2 + t 2 ) Y/2 sin (y arctan -| j 



2\172. 



H-CT(-F) (G 2 +i 2 ) ' sin ( Y arctan — 



Moreover, if we assume f(X) = X, the mean, the variance, the skewness and 
the kurtosis of CGMY distributions are given by: 

m f (C, G, M,Y) = C (M Y - X - G Y ~ X ) T(l - Y); 
aj(C, G, M,Y) = C (M Y - 2 + G Y - 2 ) T(2 - Y); 
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C (M Y - 3 - G Y - 3 ) r(3 - Y) 
a f (C, G, M, Y) = ^ LA >■ 

{C{M Y - 2 +G Y - 2 )T{2-Y)) i/2 

C (M Y - 4 + G Y - A ) T(4 - Y) 
k f (C,G,M,Y) = 3+ i J — L. 

J ' (C(M Y - 2 + G Y - 2 )T(2-Y)) 2 

The derivatives ^M, ^^ 9Im $* (t)) and 0Rc ^ (t)) are given in Tables I 
and II. 



3 An empirical comparison between the GT mo- 
ment estimator and the maximum likelihood 
estimator of stable Paretian distributions 

In this section we compare the GT moment estimator and the MLE obtained by 
inverting the characteristic function of stable Paretian distributions (see Rachev 
and Mittnik (2000)). In particular, we test the above semi-parametric estimator 
for stable distributions using simulated data. Therefore, using the algorithm 
proposed by Chambers et al. (see Chambers et al. (1976)), we generate N 
(JV=200,. . . ,5000 with step 100) stable distributions S a (a, /?,//) with parame- 
ters a = 0.51, 0.76, 1.26, 1.51, 1.76; /3 = -1, -0.5, 0.5, 1; a = 1; fj, = 1. Then we 
estimate the parameters on the simulated data (for each TV) considering both 
estimating methods: the GT moment estimator, and MLE valued inverting the 
characteristic function with the FFT. As starting point for the GT moment 
and MLE estimators we use the parameters obtained estimating the series of 
N elements with the McCulloch quantile method (see, among others, Rachev 
and Mittnik (2000)). We obtain the results of GT moment minimizing the sum 
of the asymptotic variances subject to the usual constraints. This computation 
requires less time than the MLE approximation. Minimizing the determinant of 
the asymptotic variance matrix we get more robust results, but we need much 
more computational time to approximate the parameters. 

We measure (on average) the absolute value of the percentage of the dis- 
tance between the parameters of simulated series and the estimated ones for the 
different a ,a,/3, and fi i.e., we compute the average (varying N) of \A9gt\ = 



0GT-0 s 



and similarly of \A9mle\ 



s-9s 



where 6 = a, 



a, (3, /i. These results are given in Table IV where we remark in bold the best 
approximations. We observe that the sum of the all absolute errors is higher for 
the MLE. In addition, the empirical analysis shows that the GT moment esti- 
mators present a very good performance even in comparison to those obtained 
with the MLE method. 



4 Concluding remarks 

In the paper we discussed the application of the estimating function method to 
value the parameters of distributions defined only by their characteristic func- 
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Table IV 



This table summarizes the average of the absolute errors we have using either a GT moment estimator or a MLE 
estimator for different values of a (a = 0.51; 0.76; 1.26; 1.51; 1.76 ) and j3 ( J3 = -1; -0.5; 0.5; 1 ) and <j =1; // = 1. 
Moment 





Beta 


-1 






-0.5 






0.5 












Alpha 


M 


Acr| 


M 


M 


M 


Act 


M 


M 


\Aa\ 


M 


|A/?| 


M 


M 


Act 


m 


\A M \ 


0.51 


0.07 


0.28 


0.10 


0.52 


0.04 


0.09 


0.07 


0.12 


0.03 


0.11 


0.09 


0.10 


0.04 


0.09 


0.01 


0.28 


0.76 
1.26 


0.04 

0.04 


0.06 

0.04 


0.01 
0.04 


0.58 

0.43 


0.04 

0.03 


0.07 

0.03 


0.08 
0.09 


0.24 
0.19 


0.04 

0.03 


0.08 

0.03 


0.08 
0.08 


0.21 

0.20 


0.04 

0.04 


0.05 

0.03 


0.01 

0.03 


0.52 

0.40 


1.51 
1.76 


0.03 
0.03 


0.03 
0.02 


O.OS 

0.09 


0.12 
0.09 


0.03 
0.03 


0.02 

0.03 


0.15 
0.36 


0.07 
0.04 


0.03 
0.03 


0.02 
0.02 


0.14 
0.31 


0.06 
0.04 


0.04 
0.05 


0.03 
0.05 


0.05 
0.07 


0.13 
0.14 



MLE 


































Beta 


-1 






-0.5 






0.5 






1 






Alpha 


M 


A<r| 


M 


M 


M 


A<7 


M 


M 


\Aa\ 


M 


|A/?| 


M 


M 


A<x 


M 


M 


0.51 


0.02 


0.20 


0.62 


0.47 


0.03 


0.25 


0.79 


0.25 


0.03 


0.21 


0.09 


0.05 


0.02 


0.15 


0.02 


0.04 


0.76 


0.23 


0.31 


0.19 


1.54 


0.13 


0.10 


0.50 


0.78 


0.16 


0.13 


0.14 


0.57 


0.23 


0.31 


0.19 


1.54 


1.26 


0.02 


0.02 


0.07 


0.33 


0.02 


0.02 


0.16 


0.24 


0.02 


0.02 


0.08 


0.16 


0.03 


0.02 


0.01 


0.28 


1.51 


0.02 


0.02 


0.06 


0.09 


0.02 


0.02 


0.12 


0.06 


0.02 


0.02 


0.11 


0.05 


0.02 


0.01 


0.00 


0.06 


1.76 


0.02 


0.01 


0.05 


0.04 


0.02 


0.02 


0.18 


0.04 


0.02 


0.02 


0.16 


0.04 


0.02 


0.01 


0.00 


0.04 



tions. The proposed methodology showed good versatility, since it could be ap- 
plied to any bounded function of the underlying random variable. In particular, 
we propose two EF estimators for the parameters of stable Paretian distribu- 
tions and other infinitely divisible distributions. Finally we have proposed an 
empirical comparison based on simulated data of stable Paretian distributions. 
The good results obtained with the EF moment estimator even with respect to 
the MLE method, suggest that probably we could make further improvements 
in parameter estimation using others bounded functions. 
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Abstract 

In this paper, the complete parameterization of the length ten wavelets 
is given with no parameter constraints. Among this class, examples of 
"frequency selective" wavelets (subband filters) are highlighted that per- 
form comparable to the FBI 9/7 filter as applied to image compression. 
Specifically, the parameterization is of the dilation equation coefficients for 
all trigonometric polynomials m(u>) which satisfy the necessary conditions 
for orthogonality, that is m(0) = 1 and |m(/j)| 2 + \m(u) + tt)\ 2 = 1, but 
with no restriction on the number of vanishing moments except the zeroth 
which is part of the necessary conditions. Moreover, specific parameters 
are given that correspond to the Daubechies wavelets, and a frequency 
response comparison is given showing that there are "flatter/steeper" fre- 
quency responses than the Daubechies wavelets but with fewer vanishing 
moments. We conclude with an image compression scheme to compare 
the standard wavelets, the FBI 9/7 biorthogonal wavelets with symmetric 
boundaries, and some "frequency selective" parameterized wavelets. 

Keywords: Wavelets, coefficients, orthogonal, parameterization 

1 Introduction 

Typically, when someone chooses to use a wavelet in an application, they use 
the default choices of the standard Daubechies wavelets [3] or the biorthogonal 
wavelets such as the FBI 9/7 filter [1]. Unfortunately, the wavelet needed for 
most applications is highly dependent on the data being transformed by the 
wavelet. In this paper, we give a parameterization for the length ten orthogonal 
wavelets which allows one to consider a whole continuum of scaling functions 
that vary from their number of vanishing moments to their regularity. The 
length of a scaling function is referring directly to the number of nonzero dila- 
tion equation coefficients which is one larger than the degree of the associated 
trigonometric polynomial. The length also refers to the nonzero support of the 
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scaling function. For instance, the length ten scaling functions have ten dila- 
tion coefficients, an associated trigonometric polynomial of degree nine, and is 
supported on the interval [0,9]. The standard Daubechies' wavelets were con- 
structed using a spectral factorization method in such a way that they have 
minimal length, maximal number of vanishing moments, and minimal phase. 
For example, the Daubechies' wavelets of lengths two, four, six, eight, and ten 
with minimal phase each have one, two, three, four, and five vanishing moments 
respectively. Vanishing moments allow polynomial data within the signal to be 
well approximated. In [4], Lai and R. constructed explicit parameterizations of 
all the univariate orthogonal scaling functions of lengths four, six, eight, and 
ten, but require one to solve a transcendental parameter constraint for both 
lengths eight and ten. This current paper removes this parameter constraint for 
the length ten case (refer to [8] for the unconstrained length eight parameteri- 
zation). 

Other researchers have investigated the parameterization of orthogonal wavelets 
(see [14]). It appears that Schneid and Pittner [12] were the first to give formu- 
las that would lead to the explicit parameterizations for the class of finite length 
orthogonal scaling functions after finding the Kronecker product of some ma- 
trices for wavelet lengths of two through ten. Colella and Heil investigated the 
length four parameterization in [2] . Others have constructed parameterizations 
for biorthogonal wavelets as well as multiwavelets (see [11] and [9]). Recently, 
Regensburger, in [7], constructed the explicit parameterizations for the orthog- 
onal scaling functions with multiple vanishing moments up to length ten by first 
solving the linear system of equations that result from the vanishing moment 
conditions and then solving the necessary condition for orthogonality. 

This current paper improves on the parameterizations of the past in that 
the number of vanishing moments is unrestricted and the parameterizations 
are explicitly stated with no parameter constraints for the class of length ten 
wavelets. 



2 Necessary Conditions for Orthogonality 

The following necessary conditions for orthogonality are well known in the lit- 
erature (see [3], [5], and others). Consider a scaling function 4> that satisfies the 
dilation equation 

N 

4>(x) = y hk4>(2x — k) 

fe=0 

and its associated trigonometric polynomial m of degree N which is given by 

N 
fe=0 

It is well known , that m can be written as an infinite product. In order for this 
product to converge, m must not vanish at the origin, i.e. m(0) = c ^0. This 
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condition immediately implies, where we choose the normalization c = 1, that 

JV 



E^ = L (^ 



fc=0 

Moreover, the necessary condition for the orthogonality of <fi with its integer 
shifts is given by 

|toH| 2 + |m(w + tt)| 2 = 1. (2) 

This condition is equivalent to the dilation coefficients satisfying a system of 
nonlinear equations, specifically 

N -*i i N _ i 

^ ftfe/ife+2j = ^^O')) 3 = °. • • • . — j^— 

fc=0 

where (5(0) = 1 and <5(j) = for j ^ 0. 

For the length ten case(iV = 9)that we are considering currently, we have 
the following underdetermined nonlinear system: 



ho + h{ + hi + h£ + h\ + hi, + h% + h 7 + h% + hi, = - 

h h 2 + hihj, + h 2 h 4 + h 3 h 5 + h 4 h 6 + h 5 h 7 + +h§h$ + h 7 h 9 = 

h n h 4 + hxh 5 + h 2 h 6 + h 3 h 7 + h 4 h$ + h 5 h 9 = 

h h e + h\h 7 + h 2 h$ + h 3 h 9 = 

hoh$ + hihg = 0. 

Additionally, these two conditions (1) and (2) imply the zeroth vanishing mo- 
ment condition m(n) — or equivalently the linear equations 

1 
2' 

Because of the strong patterns between the odd and even indices for the dila- 
tion coefficients, it is useful to relabel the dilation equation coefficients in the 
following fashion 



(JV-l)/2 


(JV-l)/2 


y^ h 2k = 


2 , h 2 k+i 


fc=0 


fc=0 



m(w) = Y^ a k e 2klu} + b k e { 



,(2k+l)»u; 

fe=0 

where we let n = (N—l)/2. Note, since there are no odd length scaling functions 
satisfying the necessary condition for orthogonality, N will always be an odd 
integer. 

As a means of summary with our new notation, we conclude with the fol- 
lowing statements. Given a scaling function <fr and its associated trigonometric 
polynomial 

n 

m(w) = ]T a k e 2k ^ + b k e ( - 2k + 1 ^ 

fe=0 



1 fift 
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of degree In + 1, the necessary condition for orthogonality, 

\m(cj)\ 2 + \m(w + n)\ 2 = 1, 
is equivalent to the following system of nonlinear equations: 

n-j 



1 
^2 akO-k+j + b k b k+] = -<5(j), j = 0, 

fc=0 

where 6(0) — 1 and 6(j) = for j ^ 0. 



3 Length Four 

Although the length four parameterization is well known (see [8, 14]), it is used 
in the construction of the length ten parameterization and is presented here for 
completeness. 

For length four (N = 3 and n — 1), the nonlinear system of equations is 

a + a x = - (3) 

bo + b, = \ (4) 

a 2 +a 2 + b 2 +b 2 = - (5) 

a ai + fe &i = 0. (6) 

Subtracting twice equation (6) from equation (5) gives 

(ao - ai) 2 + (bo - h) 2 = -. 
This equation allows the introduction of a free parameter, that is 

o-a — a-i = —j=sin6 (7) 

&o — b\ = —=cos9. (8) 

v2 

Combining equations (3) and (4) with (7) and (8) gives the length four param- 
eterization 

a Q = \ + ^in6, b a = \ + ^o.6, 

1 l ■ a u l l 



4 2V2 4 2V2 

These formulas are well known (see [14], [2], [8], and others). To aid in the con- 
struction of the longer parameterizations, a different period and phase shift are 
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chosen for the length four solution, that is 8 = 2a — 7r/4. With this substitution 
and some simplification, the length four solution can be written as 

ao = — (1 — cos 2a + sin 2a) 

&o = — (1 + cos 2a + sin 2a) 

ai = -(1 + cos 2a — sin 2a) 

b\ = -(1 — cos 2a — sin 2a) 

where this form will simplify future computations. It should be noted that 
this parameterization is a necessary representation for the coefficients and upon 
substituting them back into the system of equations (3)- (6), we see that they 
are also sufficient. 



4 Length Ten 

For the construction of parameterizations for length six and eight see [8] . For 
length ten (N — 9 and n = 4) , the nonlinear system of equations is given by 

(9) 
(10) 

(11) 
a ai + aia 2 + a 2 a 3 + a 3 a 4 + 6 &i + &1&2 + &2&3 + ^3^4 = : (12) 

(13) 
(14) 
(15) 

An important step in the construction is establishing the connection between 
the sums of the even and odd indexed coefficients back to the length four pa- 
rameterization. More specifically, the sums ao + a 2 + 0,4,, ai + 03, bo + b 2 + &4, 
and 61 + 63 satisfy the system of equations associated with the length four pa- 
rameterization, i.e. 



ao + ai + a 2 + a 3 + 04 - 


1 
2 


b + bx + b 2 + h + &4 = 


1 
2 


a 2 a + a\ + a\ + a\ + a\ + b 2 Q + b\ + b\ + b\ + b\ = 


1 
2 


aia 2 + a 2 a,3 + 0,30,4 + b bi + bib 2 + b 2 b 3 + 6364 = 


= 


a a a 2 + aia 3 + a 2 a A + b b 2 + &1&3 + b 2 b A = 


= 


aod3 + a-La,4 + bob3 + bibi = 


= 


0004 + &0&4 = 


= 0. 



1 

2 

1 

2 

1 

2 
(ao + a 2 + 04)(oi +03) + (60 + 62 + b4.)(bi + 63) = 0. 



(ao + a 2 + a 4 ) + (ai + a 3 ) 
(60 + b 2 + 64) + (61 + 63) 

(a + a 2 + a 4 ) 2 + (01 + a 3 ) 2 + (b + b 2 + b 4 ) 2 + (b x + b 3 ) 2 
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The third equation is equivalent to the sum of equations (11) and (13), and the 
last one is equivalent to the sum of equations (12) and (14). Therefore, we can 
use the length four parameterization for these sums, i.e. 



a - 
b 



0-2 + «4 = 


-(1 — cos 2a + sin 2a) 


b 2 + b 4 = 


-(1 + cos 2a + sin 2a) 


oi + a 3 = 


= -(1 + cos 2a — sin 2a) 

4 V ' 


h+b 3 = 


= — (1 — cos 2a — sin 2a) 



In an effort to linearize the system of equations, note that the sum and difference 
of equation (11) and twice equation (15) give the two equations: 



(ao + a 4 ) 2 + (6o + h) 2 -- 


1 

2 " 


-a 2 - 


-a\ - 


2 l2 

- a 3 - b x - 


-bl- 


-b\ 


(16) 


(a - a 4 ) 2 + (b - bi) 2 - 


1 
2 " 


-a 2 - 


-a\ - 


-a\-b\- 


-bl- 


-b\ :- 


= P 2 - (17) 



Although the right hand side, p 2 , has not yet been determined, we use the fact 
that the right-hand sides of equations (16) and (17) are equivalent and introduce 
two new free parameters /3 and 7 in the following fashion: 



a + 04 = 


= p cos (3 


b Q + bi -- 


= p sin j3 


ao — 04 = 


= p cos 7 


bo - bi - 


= psm.7. 



There are now 8 linear equations and ten unknowns. For the last two equa- 
tions, we introduce two additional free parameters q and r for the differences 
between the odd indices and rotate them using an orthogonal matrix which 
depends on the parameter 7, i.e., 

01—03 = q cos 7 — r sin 7 
&i — 63 = gsin7 + rcos7. 

The decision to rotate these free parameters by the orthogonal matrix depending 
on 7 was based on the nonlinear relationship in equation (14) which involves 
cos 7 and sin 7. 

Now, solving the linear system of equations involving the free parameters 
a,/3,7,p, o, and r which necessarily satisfy the nonlinear system of equations 
yields 
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ao = — (cos j3 + cos 7) 
b = - (sin (3 + sin 7) 

ai = -(1 + cos 2a — sin 2a) H — (q cos 7 — r sin 7) 
8 2 

&i = — (1 — cos2a — sin2a) + — (gsin7 + r COS7) 
8 2 

<i2 = — (1 — cos 2a + sin 2a) — pcos/3 
b 2 = -7(1 + cos 2a + sin 2a) — psin/3 

03 = — (1 + cos 2a — sin 2a) — — (qcosj — r sin 7) 

8 2 

bs = — (1 — cos 2a — sin 2a) — -(gsin7 + r cos 7) 
8 2 

04 = - (cos /3 — cos 7) 

P 
64 = — (sin/3 — sin 7). 

As far as the nonlinear equations, notice that equation (15) is satisfied im- 
mediately. Now plugging the linear solutions into equation (14), we get 

a a 3 + aia 4 + b b 3 + 6164 = 

-(-4q + cos/? + sin/3 + cos(2a + /3) - sin(2a + /?)) = 
8 

and since p = would result in a constrained solution, we have the parameter- 
ization of q as 

q = -(cos/3 + sin/3 + cos(2a + (3) — sin(2a + /?)) 

= — (cos a — sin a) cos(a + /3). 

Using this value of q and substituting the linear equation solutions into equation 
(12), we see that equation (12) is now satisfied while equation (13) produces a 
quadratic equation in terms of the unknown p, that is 

a a a 2 + aia 3 + a 2 a A + b Q b 2 + 6163 + b 2 b A = 

p (cos a + sin a) sin(a + j3)p + ■ ■ ■ 

— (16r 2 + 2cos(2(a + /?)) - sin(2(a + /?)) + 2 sin 2a + sin 2/3 - 2) = 

which has a solution in terms of the free parameter r of 

p = — I (cos a + sin a) sin(a + /3) ± \f\ — 4r 2 — cos(2(a + /3)) ) 
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with a constraint on the size of the parameter r of 

r 2 < l-cos(2(a + /3)) 
- 4 

= isin 2 (a + /3) (19) 

that is necessary to keep the parameter p real. So, we introduce the free pa- 
rameter 5 for r that satisfies the inequality (18), i.e. 

r = —= cos <5 sin(a + 0). 

V2 

After substituting this parameterization of r back into the expression for p, we 
have 



P 



— sin(a + 0) ( cos a + sin a — v2 sin 5 



Substituting these parameterizations of p, q, and r back into the nonlinear sys- 
tem, shows that in fact these necessary conditions on the parameters are also 
sufficient. Therefore the complete parameterization of all coefficient vectors of 
length ten that satisfy the necessary conditions for orthogonality can be param- 
eterized by the following: 



sin S 



p = — sin(a + 0) I cos a + sin a — v2 

q = — (cos a — sin a) cos(a + 0) 

r = — — cos<5sin(a + 0) 
v2 
p 
ao = — (cos + cos 7) 

p 

60 = -(sin/3 + sin7) 

ai = — (1 + cos 2a — sin 2a) + -(qcos-y — rshi7) 
8 2 

61 = — (1 — cos 2a — sin 2a) + -(qsin7 + r cos 7) 

8 2 

<i2 = — (1 — cos 2a + sin 2a) — pcos0 

62 = 7 (1 + cos 2a + sin 2a) — psin/3 

03 = — (1 + cos 2a — sin 2a) — — (qcosj — r sin 7) 

8 2 

63 = — (1 — cos 2a — sin 2a) — — (<7sin7 + r cos 7) 

8 2 

04 = — (cos — cos 7) 

p 

64 = — (sin — sin 7) 
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Wavelet 


a 





7 


8 


Haar 


-2.35619449 


-2.35619449 


-2.35619449 


1.57079633 


D4 


-2.61799388 


-2.09439510 


-2.09439510 


1.30899694 


D6 


-2.85603576 


-1.96184952 


-1.96184952 


1.07095506 


D8 


-3.08353313 


-1.88256239 


-1.88256239 


0.93539307 


D10 


-3.30496928 


-1.80914574 


-1.85080961 


0.88683222 


OJ10 


-3.27396878 


-1.82536723 


-1.87718786 


0.93186197 


T10 


0.71389116 


-2.86392827 


1.66872168 


2.43807534 


MF10 


0.74143253 


-2.94548381 


1.51834616 


2.20225600 



Table 1: Parameters associated with the standard Daubechies wavelets and 
some other length ten wavelets. 



which is wcll-dchned for any parameters a, /3, 7, and 5 € IR. This parameter- 
ization contains all of the standard Daubechies wavelets of length ten or less 
where the parameters needed for each are given in Table 1 along with some 
other wavelets which will be used in the image compression comparison. Figure 
1 shows the scaling functions associated with the parameters chosen in Table 1. 



5 Frequency Selective Wavelets and Image Com- 
pression 

In Daubechies seminal work [3], an emphasis is placed on discrete orthogonal 
wavelets which have a maximum number of vanishing moments. In that work, 
an example is put forth where one of the vanishing moments is moved away from 
u> = —it towards — ir/2 and the comment is made that this wavelet is much closer 
to a "realistic" subband coding filter (see pg. 201 in [3]). In a later work, Ojanen 
in [6] develops a scheme to find the maximal regularity for the scaling functions 
based on moving a finite number vanishing moments towards the center. Ojanen 
lists one optimal length 10 wavelet which we will call OJ10(see Table 1 for 
the necessary parameters). The frequency response graphs in Figure 2 and 3 
illustrate how the filters will treat various frequencies in the range [— tt,tt]. An 
"ideal" filter (which can not be implemented with a finite convolution) has a 
frequency response that is one on the interval [— 7r/2,7r/2] and zero otherwise. 
In Figure 3b, notice that OJ10 uses a zero near -2.6 to improve the "flatness" 
in this area and subsequently giving it a steeper transition. In the literature, 
band-pass filters that model the "ideal" filter are called "frequency selective". 
T10 and MF10 both have only the zeroth vanishing moment, but MF10 keeps a 
shallow profile in the first part of the interval and achieves a steeper transition 
than D10 or OJ10. T10 is highlighted here because of its extreme steepness at 
the transition compared to the others, but sacrifices flatness in the first part of 
the interval. 
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(a) -% L 



(c) 



(e) 



2 3 4 5 





2 3 4 5 



(b) 




23456789 



(d) 



(f) 



(g) ° 




(h) 



23456789 




2 3 4 5 6 




2 3 4 5 




Figure 1: Scaling function plots for the specific parameters in Table 1. (a) Haar, 
(b) D4, (c) D6, (d) D8, (e) D10, (f) OJ10, (g) T10, and (h) MF10. 
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(a) 



(b) 




(c) 






(d) 

Figure 2: Frequency response plots |m(w)| for the scaling functions from Table 
1. (a) Haar, D4, D6, D8, and, D10 (b) zoomed in plot of previous, (c) D10, 
T10, OJ10, and MF10, (d) zoomed in plot of previous. 



176 



Frequency selective parameterized wavelets of length ten 









Row= 


=469 








Wavelet 


Barb 


Boat 


Lena 


Marm 


Bark 


Fing 


Sand 


D10 


3034 


1052 


820 


73 


2973 


1676 


1943 


OJ10 


3039 


1050 


813 


73 


2946 


1675 


1954 


T10 


2822 


957 


880 


95 


2610 


1670 


1993 


MF10 


2793 


960 


846 


82 


2640 


1669 


1959 


FBI 9/7 


2900 


976 


798 


58 


2795 


1668 


1825 








Row= 


=191 








Wavelet 


Barb 


Boat 


Lena 


Marm 


Bark 


Fing 


Sand 


D10 


2064 


1651 


951 


113 


2384 


1544 


2317 


OJ10 


2029 


1665 


943 


105 


2358 


1532 


2311 


T10 


2409 


1603 


986 


256 


2210 


1529 


2255 


MF10 


2379 


1603 


948 


163 


2188 


1532 


2241 


FBI 9/7 


2166 


1593 


880 


106 


2198 


1548 


2090 








Row= 


=287 








Wavelet 


Barb 


Boat 


Lena 


Marm 


Bark 


Fing 


Sand 


D10 


2818 


1799 


929 


76 


2407 


1629 


1972 


OJ10 


2799 


1785 


917 


70 


2398 


1616 


1960 


T10 


3163 


1829 


946 


184 


2272 


1555 


2126 


MF10 


3172 


1826 


941 


116 


2259 


1568 


2099 


FBI 9/7 


2856 


1756 


854 


90 


2182 


1634 


1911 



Table 2: The t\ norm of the wavelet coefficients for a one-level decomposition 
of a single row from some test images. The lowest i\ norm has been boldfaced 
for convenience. 



In our first numerical experiment, we compare the four length-ten filters D10, 
OJ10, T10, and MF10 with the biorthogonal FBI 9/7 wavelet with a single level 
decomposition of a row from our test images and compute the i\ norm of the 
wavelet coefficients. A lower i\ norm would suggest a more efficient representa- 
tion for the vector. We chose three random rows and tested all five wavelets on 
seven different images using the specified row. As can be seen in Table 2, the 
9/7 filter appears to do better more than half of the time. The lowest i \ norm 
has been boldfaced for each image. The results are not conclusive, but are still 
interesting. In a second more comprehensive numerical experiment, we present 
an image compression comparison for the wavelets D10, OJ10, T10, MF10, and 
the FBI biorthogonal 9/7 wavelet with symmetric boundary extensions. Be- 
cause of its common use as an industry standard and similar length, we chose 
to include the biorthogonal 9/7 wavelet in the comparison. The 9/7 biorthogo- 
nal wavelet has one advantage over the the length ten parameterization in that 
it is symmetric. This symmetry can be used to improve the performance of 
image compression at the image boundaries. We have included this advantage 
in our numerical results. The details of the numerical experiment are as follows: 
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(e) 







(f) 



Figure 3: The seven test images used in the compression scheme (512 x 512 
grayscale images): (a) Barb, (b) Boat, (c) Lena, (d) Marm, (e) Bark, (f) Fing, 
and (g) Sand. 
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Wavelet 


Barb 


Boat 


Lena 


Marm 


Bark 


Fing 


Sand 


D10 


26.29 


28.74 


32.30 


34.78 


21.27 


30.14 


23.27 


OJIO 


26.34 


28.76 


32.32 


34.93 


21.31 


30.18 


23.28 


T10 


26.43 


28.59 


32.11 


35.01 


21.41 


30.35 


23.42 


MF10 


26.43 


28.69 


32.43 


35.57 


21.44 


30.28 


23.44 


FBI9/7 


26.30 


29.22 


32.81 


35.11 


21.43 


30.14 


23.46 



Table 3: PSNR results for the seven images Barb, Boat, Lena, Marm, Bark, 
Fing, and Sand using the wavelets D10, OJ10, T10, MF10, and the FBI 
biorthogonal 9/7. The best PSNR for each image is boldfaced. 



• Seven level decomposition with periodic boundaries (except for 9/7 which 
has symmetric extensions) using D10, OJ10, T10, MF10, and 9/7 FBI. 
Note: All of the scaling functions presented in this paper need to be 
normalized by a multiple of \/2 during implementation. 

• Embedded Zero-tree (EZW) compression (see [13] and [10]) with a file size 
ratio of 32:1. For this experiment, all of the images are 512 x 512 with a 
PGM file-size of 256Kb and a compressed file-size of 8Kb. This particular 
EZW implementation is not completely optimized and would not neces- 
sarily yield the maximum PSNR possible but serves well as a comparative 
measure of the true compressibility of the wavelet decomposition. 

• Seven level reconstruction followed by a Peak Signal to Noise Ratio (PSNR) , 
i.e. 



RMSE = 



\ 



512 512 



i=l i=l 

/ 255 \ 
PSNR = m °^{RMSE) 



— A- -I 2 



where Ai j is the original matrix of grayscale values and Aij is the com- 
pressed version. 

The results from the experiment are given in Table 3. 
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Abstract 

We address in the present paper the following problem : is the linear 
structure the only one which allows us to construct approximation oper- 
ators ? As an answer to this problem we propose new, so-called pseudo- 
linear approximation operators, which are defined in a max-product al- 
gebra. We consider a nonlinear (max-product) Bernstein operator and a 
nonlinear (max-product) Favard-Szasz-Mirakjan operator. We prove that 
the approximation errors given by these operators are similar to those 
of the corresponding linear operators provided by the classical Approx- 
imation Theory. Also, by some graphs we illustrate that the nonlinear 
operators can better reproduce the shape of some continuous functions 
(which are not differentiable or have abrupt changes of the values in some 
points) than their linear counterparts. 

1 Introduction 

The linear approximation operators provided by the classical Approximation 
Theory use exclusively as underlying algebraic structure the linear space struc- 
ture of the reals. In the present paper we address the following problem : is 
the linear structure the only one which allows us to construct approximation 
operators ? 
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This problem was proposed in [1], [2] for Shepard operators. The findings in 
these two papers were that besides the linear structure we may opt for different 
structures as max-product, max-min (fuzzy) algebras or semirings with gener- 
ated pseudo-operations. In this sense, Shepard-type operators of Max-Product 
and Max-Min kinds were studied together with operators of Shepard-type based 
on generated pseudo-operations. Note that these operators are nonlinear (they 
are so-called pseudo- linear) . So, the topic of these papers and the present one 
partially fits on the area of Nonlinear Approximation, as it is described by e.g. 
[3] , [5] . It is only a partial fit on this area, because while the operators are non- 
linear, the algebraic structure underlying them is still a structure of linear space. 
In [3] and [5] , the nonlinearity comes from the construction of approximations 
based on a dictionary of functions. Because of this reason, the method is not 
fully constructive. In contrast, the approach presented here is fully constructive 
and quite simple. 

Recently, the monograph [4] (see pp. 324-326, Open Problem 5.5.4) brings 
into attention the same question of constructing approximation operators with 
max-product or max-min operations, but in a more systematic way. Thus, 
following [4] we consider the nonlinear Bernstein operator of max-product kind 

n 

\J Pn,k{x) 

k=0 

with p n ,k{x) — (?)x (1 — x) n ~ k and the nonlinear Favard-Szasz-Mirakjan oper- 
ator of max-product kind 

oo 

V *#/(£) 

Fi M \f)(x)= k -^ . 

V{nx) k 
~kl~ 
fe=0 

Our findings are that these operators have very similar properties to the cor- 
responding linear operators provided by the classical Approximation Theory. In 
this sense we show that the error estimates in approximation by these operators 

is of order O 



Wi 



(*£) 



Some experimental results are also discussed and 
our finding is that the proposed max-product operators can better follow abrupt 
changes in the target function than their classical linear counterparts. 

2 Nonlinear approximation operators 

Over the set of positive reals, K+, we consider the operations V (maximum) and • 
product. Then (R+, V, •) has a semiring structure and we call it as Max-Product 
algebra. 
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Let / C R be a bounded or unbounded interval, and 

CB + (I) = {/:!—> R + ; f continuous and bounded on I}. 

The general discrete form of a max-product approximation operator L n : CB + (I) 
CB + (I), is 



L n (f)(x) = \J ' K n {x,Xi) ■ f(xi), 



i=0 

or 

oo 
L n {.f){x) = \J K n {x,Xi) ■ f(xi), 
i=0 

where n G N, / G CB + (I), K n (-,Xi) G CB + (I) and x 4 G J, for all i. These 
operators are nonlinear, positive operators and moreover they satisfy a pseudo- 
linearity condition of the form 

L n (a ■ / V p ■ g)(x) = a ■ L n {f){x) V [3 ■ L n (g)(x),Va, (i e R+,f,g :/^H+. 

In this section we present some general results on positive nonlinear opera- 
tors. These are used later in the study of nonlinear Bernstein and Favard-Szasz- 
Mirakjan operators of max-product kind. 

Lemma 1 Let I C M be a bounded or unbounded interval, 

CB + {I) = {/:/—> M + ; f continuous and bounded on /}, 

and L n : CB + (I) — > CB + (I), n G N be a sequence of operators satisfying the 
following properties : 

(i) if f, 9 G CB+(I) satisfy f < g then L n (f) < L n (g) for all n G N ; 

(ii) L n (f + g)< L n (f) + L n (g) for all f,g G CB+(I). 

Then for all /, g G CB + (I), n G ./V and x G I we have 

\L n (f)(x) - L n (g)(x)\ < L n (\f - g\)(x). 

Proof. Let /, g G CB + (I). We have / = / - g + g < \f - g\ + g, which by the 
conditions (i) — (ii) successively implies L n (f)(x) < L n (\f — g\)(x) + L n (g)(x), 
that is L n (f)(x) - L n (g)(x) < L n (\f - g\)(x). 

Writing now g = g — / + / < \.f — g\ + / and applying the above reasonings, 
it follows L n (g)(x) — L n (f)(x) < L n (\f — g\)(x), which combined with the above 
inequality gives \L n (f)(x) - L n (g)(x)\ < L n (\f - g\)(x). m 

Remark 2 1) It is easy to see that our max-product operators satisfy the con- 
ditions in Lemma 1, (i), (ii). In fact, instead of (i) they satisfy the stronger 
condition 

Ln(f V g)(x) = L n {f){x) V L n (g)(x), /, g G CB+(I). 

Indeed, taking in the above equality / < g, f, g G CB + (I), it easily follows 
L n (f)(x)<L n (g)(x). 

2) In addition, it is immediate that the max-product operators are positive 
homogenous, that is L n (Xf) = \L n (f) for all X > 0. 
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Corollary 3 Let L n : CB + (I) — ► CB + (I), n £ N be a sequence of opera- 
tors satisfying the conditions (i)-(ii) in Lemma 1 and in addition being positive 
homogenous. Then for all f € CB + {L), n <G N and x <G / we have 



\f(x)-L n (x)\< 



L n (ip x )(x) + L n (e )(x) 



vi(f\8)i + f(x)-\L n (e )(x)-l\, 



where S > 0, eo(t) — 1 for all t e I , (p x (t) = \t — x\ for all t <G / . x e / and 
wi(/;<5)j = max{|/(x) - f{y)\;x,y e I, \x-y\ < S}. 

Proof. The proof is identical with that for positive linear operators. Indeed, 
from the identity 

L n (f)(x) - f{x) - [i„(/)(x) - f{x) ■ L n (e )(x)} + f(x)[L n (e )(x) - 1], 

it follows (by the positive homogeneity and by Lemma 1) 

|/(:r) - L n {f){x)\ < \L n ifix))ix) - L n {f{t)){x)\ + \f(x)\ ■ \L n (e a )(x) - 1| < 

Ln(\f(t) - f(x)\)ix) + \fix)\ ■ \L n (e )(x) - 1|. 
Now, since for all t, x £ I we have 

|/(i)-/(aOI<wi(/;|*-a:|)i< ;|/-.»-| + ] 



wi(/;*)/, 



replacing above we immediately obtain the estimate in the statement. ■ 
An immediate consequence of Corollary 3 is the following. 

Corollary 4 Suppose that in addition to the conditions in Corollary 3, the 
sequence (L n ) n satisfies L„(eo) = &o, for all n G N . Then for all f € CB + {I), 
n € N and x G / we have 



\f(x)-L n (x)\< 



1 + -L n {ip x ){x) 



ui(f;S)i- 



Remark 5 The max-product operators that we will construct in the following 
sections will satisfy the additional condition in Corollary 3, so that Corollary 3 
one applies to these kind of operators. 



3 Nonlinear Bernstein Operator 

Let / : [0, 1] — > M+ be continuous. We consider the following nonlinear Bernstein 
operator of max-product type 



Bi M \f)(x) = ^ 

\/ 'Pn,k( X ) 



(1) 



k=0 
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with 

i—k 



Pn,k{x)= (Vjx k {l-Xy 



We provide an error estimate for this approximation operator in terms of the 
modulus of continuity. 

Theorem 6 The following pointwise estimate holds true for the nonlinear Bern- 
stein operator 

\Bi M \f)(x) - f(x)\ < Clo, ( /, ^ x{1 ~ x) +-) , x e [0, l],n e N, 

V ^ n / [0 ,i] 

where C > is an absolute constant independent of f, n and x. 

wi(/,*)[o,i] =sup{|/(»)-/(i/)|;a;,yG [0,l],|»-y| < <5}. 

Proof. It is easy to check that the max-product Bernstein operators fulfil the 
conditions in Corollary 4 and we have 

\Bi M \f){x) f{x)\ <{l+ i-sW^Xx)) Wl (/, 5„) [0ll] , 

where (p x (t) = |t — x\. So, it is enough to estimate 

n 
\J Pn,k{ X ) \n~ X \ 

4 M) (^)(*) = ^r 



V Pn,k( X ) 



fe=0 



First we calculate \J p n k(x) for fixed n,x. Let E n ^(x) = p n .k+i(x)—p n ,k(x), 



fc=0 

fc € {0, ...,n — 1}. We have by successive calculations 

E n Mx) = ( k n +i y k+1 (l-x) n - k - 1 -(fyx k (l-x) n - k 

fcj^ 1 -^ fc + i • 

We have nx — k — 1 + .t > if and only if E n ^(x) > 0, i.e., p n .k+\{x) > p n> k{x). 
Further nx — fc — 1 + x > if k < no; — (1 — cc), i.e., k < [(n+ l)x) — 1. It follows 



that \J 'p nk (x) = p, hr (x) = (")x r (l - x) n r , with r=[(n+ l)x] 

io 

Pn,k( X ) \n ~ X \ 



fc=0 

Now we will estimate the ratio 



, ,, , r = [{n+l)x\. 

Pn,r\% ) 
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We observe that nx < nx + x < nx + 1, so we have two cases: r = [nx] or 
r = [nx] + 1. We will present only the case r = [nx], similar result being true 
for r = [nx] + 1. 

We observe that r — [nx] implies nx — 1 < r < nx, i.e., 



r r + 1 

- < x < . 

n n 



Case 1. If fc < r then we have 

k 

n 



k r — k + 1 

x < . 

n n 



We estimate in what follows the expression 



A _ Pn, fc QE)|£-s| , 

^-n.k.r — ci j ft _i ' ■ 



Pn,r \X) 



By (3) we have 



.4 



n.k.r 



a)- fe d 



i)" fc I- - x\ 



CVa 



< 



fc!(n — fc)! \ a; 



r—k 



r\(n — r)\ fl — x\ r—k + 1 



(2) 



(3) 



1 — x ^ n—r 



Since the function — - is decreasing for a; € [0, 1] we have by (2) ^-^ < 



Further, we have 



r\{n — r)\ ( n — r\ r — k + 1 

-™-n,k,r _^ TT7 7TT I I ^r.k.n 

k\\n — k)\ \ r J n 

We will find the maximal term B r _k.n for fixed n, r and variable fc < r. We have 

1 r\{n — r)\ ( n — r^ (r — k (n — r) (r — k + 1) 



n k\(n — k — 1)! \ r J \ k 4- I 

r— fc— 1 /_ 7 \2 



r!(n — r)! (n — r 
k\{n-k-X)\ V r 



r(n — fc) 
(r - fc) 2 - r - 1 + r^±L 
r(n~k)(k+l) 



We observe that (r - k) 2 - r - 1 + ^-^ > for (r - fc) 2 > r + 1 - ^-^ i.c 



r — k > t/r + 1 — ^^^ and this holds for fc < r 
term, therefore is given by 



(r+l)(»-r) 



The maximal 



fcn = 



(r + l)(n — r) 



l = r - 



(r + l)(n — r) 
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so we have A n k r < B n ^, r < B n ^ ,r- Further, by Stirling formula we obtain 

v/27rr rV™S-^/ \n-r / \ r-fen , , 1 

^^ v e „_ r — (n-r) n r (n-r\ r - fc + l 



IT H r-fco + l -• 

fc / \n-koJ n 



Now, since fco ~ J* — 



(r+l)(n— r) 



we get 



n.ko./ 



\ 



,. _ ^ (r+l)(n-r) „ _ r + , / (r+l)(»-r) 



' l (r + l)(n-rj | \ 1 
V n / n 



We observe that 



r _ A /(E±iK^ n _ r+A /(z±iK-r) 1 + ,/J^L (2r + l)-^i 



nr 2 (n—r) 



and 



so we obtain 



nr z (n — r) y n(n — r) 



< 



Finally, we obtain 

\ V n In 

Since r = [nx] we get 



0(1). 



, _ /( <: (< j jnx + l)(n - nx) + \ 1 ^ ^ / 0,(1 - x) + 1 
n / n \ vn n 



Case 2. If fc > r then 



fc 

x 

n 



fc fc — r 

x < . 

n n 



We estimate the expression 



_ Pn, fc QE)|£-*| 
-^njKjr — / \ , ft /* / . 
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We have 



(l)x k (l-x) n - k \±-x\ rl(n-r)[ ( 1 - x\ r ~ k k - r 



C)x r (i-xy 



< 



k\(n — k)\ \ x J n 

i-x ^ n-r-i Further we 



Since the function ^—^- is decreasing we have by (2) ^—^- > 
obtain 



-™-n,k,r J^ 



r\(n — r)\ ( r + 1 



k—r 



k- 



— £>r,k,r- 



k\(n — k)\ \n — r— 1/ n 

We find, similar to Case 1, the maximal term B r k, n for fixed n, r to be obtained 
when feo = 



(r+l)(n— r) 



1 = r 



(r+l)(n— r) 



. Now we have k$ 



(r+l)(n— r) 



and so, 



£>n.ko,r 



\ 



(r + l)(n-r) \ 1 



,. _ x/ (r+l)(»-r) n _ r + , / (r+l)(»-r) \ V // 



Similar to case 1, finally we obtain 



and 



A n ,k,r S L/ 



4, r <ch (r+1)( "- r) + iU. 

n / n 



I (nx + l)(n - nx) | A 1 ^ ^ / ^(l-x) 1 

V n / n \ v^ n 



Taking into account the estimates in Cases 1 and 2 we get 



B<W(^)(*)<ch/^ + !]<-% 



Taking tf n = ^p 



- we obtain 

n 
|5i M) (/)(*) " /(*)| < O* (f, VX(l ~ X) + ~ 



[0,1] 



which completes the proof. 



Remark 7 1) The error estimate shown by the above theorem is similar to 
the error estimate for the linear Bernstein operators. However, since obviously 
B { n M) (f)(0) = /(0) and B { n M) (f)(1) = f(l), it is clear that the term 1/n in the 
estimate of Theorem 6 could be dropped. 
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2) The proof of the above theorem evidently implies the uniform estimate 



max 

a;e[0,l] 



\B { n M \f){x) f{x)\ < C lWl (f, ±) ,n G N. 

V y/nj [0; i] 



However, we conjecture that in fact the order of approximation 0[uj\{f; 1/y/n)] 
in Theorem 6 might be improved to the order (9[wi(/;ln(n)/n)]. 

4 Nonlinear Favard-Szasz-Mirakjan Operator 

Let / : [0, oo) — ► R + be continuous. We consider the following nonlinear Favard- 
Szasz- Mirakjan operator 

oo 



Fi M \f)(x) = 

V 



fc=0 

oo 

(nx) k 

fe=0 



We provide an error estimate for this approximation operator in terms of the 
modulus of continuity similarly to the case of Bernstein operators. 

Theorem 8 The following pointwise estimate holds true for the nonlinear Favard- 
Szasz-Mirakjan operator 

\F ( n M \f){x) - /(*) | < Cwx (f,^) ,x>0,ne N, 

V v«7[o,oo) 

where C > is an absolute constant (that is independent of f, n and x) and 

wi(/,<J)[o,oo) = sup{|/(a;) - f(y)\;x,y >0,\x-y\< S}. 
Proof. By Corollary 4, the error bound is controlled by the ratio 

\I^\--A 

V fc! In I 



Bi M H^)(x) - 



k=0 



oo 

(nx) k 



V 



fc! 

k=0 



oc 



(nx) k 
fe=0 

(nx) k+1 (nx) k (nx) k nx — k — 1 



First we calculate \J ^i for fixed neN,i£ [0, oo). We observe that 

fe=0 

E n . k {x) 



(fc + 1)! fc! fc! fc + 1 
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We have E n ^(x) > if and only if fc < nx — 1. It follows that \J ^i = ^ , 



with r = [ra]. 

Now we estimate the ratio 



fe=0 



V^ 



(m) I fc 



Rn,r(X) - (na;)r 

r! 



- , r = [nxj . 



Case 1. If A; < r then we have 

k 



k r — k + 1 
= x < . 



n n 



We estimate in what follows the expression 



A, 



,k,r 



(nx) k |fc 
fc! I n 



(ni) r 



, fc < r. 



As in the proof of the estimate for the Bernstein operator, we have by (3) and 

(2) 

(nx) k r! r — fc + 1 fe _ r r!r — fc + 1 

A n .k,r ^ ~ ; ; < r 



k\ (nx) r n 
Let £V,fc = r k - r £(r - k + 1). We have 



fc! n 



£r,fc+i - £ r , fc = r fc - r+1 7 ^— T (r - fe) - r*- r -(r - k + 1) 



(fc + 1)! 
fc_ r r! (r-fc) 2 -r-l 



fc! 



fc! 



fc + 1 



It is easy to check that the maximal term E rk is attained for fc = [r + 1 
\/r + l] =r- [y/r + 1]. 

By Stirling formula we obtain 



E r ,<E,, kn =r k «- r - 



rhvr 7 ^ 



^r,& _; ^r,fco 



\^^i% 



^(r-fc + i) 



„ r _ Vr+l-H 



: — — 1 e-[^+ T i ([>/F+T] + 1) 



Further we have 
E r ,k 



^^e^ r+1 (VF+1 + 1) 



y73^TT<^ T+1 >^^+ T T^+ 1 + 1 >- 
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and taking into account that 

J -(Vr + 1 + 1) < (Vr + 1 + 1) < (Vnx+l + i) 

finally we obtain 



Vnx +1 + 1 _ n (y/x 

-n-n.k.r _i — *-s I ~~ F= 

Case 2. If A; > r then we have 

(nx) k I fe _ I . , 

r! 

Further, if E r .k = (r + 1) |jj(fc — r), similar to Case 1, the maximal term is 
attained for fc = r + [\/r + 1] . 

By Stirling formula we have by successive calculations 

E r .k < E rM ~ ./ T (Vr + 1 + 1) . 

V r + vr + 1 

Further we have . / r /-x rW r + 1 + 1) < (V nx + 1 + l) and finally we obtain 



A n . k , r < V ^ TJ+1 =o(^),k>r. 



n \\ n / 

Taking into account the estimates in cases 1 and 2 we get 



Taking <L = ^ + - we obtain 



\Fi M \f){x)-f{x)\<Cu Jl [f, 

[0,oo) 



Remark 9 lj TTie error estimate shown in this theorem is similar to that for 
the linear Favard-Szdsz-Mirkjan operator. 

2) The pointwise error estimate in the above theorem obviously implies the 
uniform estimate 

max \Fi M \f)(x) - f(x)\ < Cm (f, -±=) 

xe[0,a] V V«/[0,oo) 

for any a > 0. However, we conjecture that in fact the order of approximation 
0[u>i(f] 1/V 7 ^)] i n Theorem 8 might be improved to the order 0[u>i(f] ln(n)/n)]. 
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3) It is interesting to note that the same estimate could be obtained for the 
truncated version of the Favard-Szdsz-Mirakjan operator 

n 

F ( n M \f)(x) = *=°^ ,xe [0,oo),n e N. 

V 



(nx) k 
k=0 



5 Applications 



First it is worth noting that from computational point of view, the nonlinear 
operators in the previous sections require a less amount of computation than 
their linear counterpart, since the computation of "max" is faster than that 
of the "sum". Also, if we take into account that the maximal term in the 
denominator is found explicitly in the proofs of the proposed theorems, then 
this does not require an extra loop for computation. 

In what follows, we illustrate by two concrete examples another possible 
usefulness of the proposed nonlinear operators. 

Thus, let us consider the functions /i, /2 : [0, 1] — > K+ 



h{x) 

and 



1 



•0.1 



r i if o < x < 0.4 

h{x)=\ IO.t-3 if 0.4<x<0.5 . 
[ 2 if 0.5 < x < 1 

In Figs. 1 and 2 we compare the linear and nonlinear Bernstein and Favard- 
Szasz-Mirakjan operators approximating the function f\. 

We observe that the nonlinear operators of either Bernstein or Favard-Szasz- 
Mirakjan type are able to better follow the peaks of the original function. 

In Figs. 3 and 4 we compare the linear and nonlinear Bernstein and Favard- 
Szasz-Mirakjan operators associated with the function fa. 

Remark 10 It is clear that in contrast to their linear counterparts, due to 
the "max" operator, these nonlinear operators in general do not preserve the 
smoothness of the function f{x), that is, even if f has continuous derivative, 
Bn (f)(x) and Fn (f)(x) fail to be differ entiable at some points. 

However, as the graphs in the case of the function fz(x) show, for the approx- 
imation of functions which are not differentiable at some points, these operators 
seem to be more suitable than their linear counterparts. 
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0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Figure 1: Comparison of Bernstein operators. Solid line: /i, dotted line: linear 
case, dashed line: nonlinear case. 




0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 



Figure 2: Comparison of Favard-Szasz-Mirakjan operators. Solid line: / 1; dot- 
ted line: linear case, dashed line: nonlinear case. 
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1 1 
i I 

1 

i I :' 

i / / 

' /■*" 

i /■' 

' i 

i j 
i .7 
1.7 

/ / 

■I j 

■*"' / 

•'" ' / 
•'"' ' / 

' / 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Figure 3: Comparison of Bernstein operators. Solid line: j^-, dotted line: linear 
case, dashed line: nonlinear case. 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



Figure 4: Comparison of Favard-Szasz-Mirakjan operators. Solid line: /2, dot- 
ted line: linear case, dashed line: nonlinear case. 
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Abstract. Some problems of finding exact values of the errors j 4<t(/)l(k) of 
best approximation by entire functions of exponential type in integral metrics 
are discussed. In particular, we prove generalized Markov- and Nagy-typc 
theorems and apply them to find A„-(f) L (^\ for some even and odd functions, 
such as |x| A , sgn(x) |x| , and x n log |x|. 

KEY WORDS: Best approximation, entire functions of exponential type. 



1. Introduction 

In this paper we discuss some problems of finding exact values of the errors of 
best approximation by entire functions of exponential type in the integral metric. 

Let L(R) be the Banach space of all functions / with the finite norm ||/||l(r) : = 
Jr \f{x)dx and let B a be the class of all entire functions of exponential type a > 
0. We define the error of best approximation by functions from B a to a locally 
integrable function / : R — > R by 

4r(/)z,(R) := mf ||/ - 3<t||l(k)- 

The Fourier transform of a function or a tempered distribution / is denoted by 
JF(/); similarly, the cos-transform of an even function or a tempered distribution 
/ is denoted by F c {f) an d the sin-transform of an odd function or a tempered 
distribution / is denoted by F s (f). In particular for / <G L(M), 



T c (f){t) := / f(x) costxdx, T s (f)(t) := / f(x) s'mtxdx, 

JR 

F(f)(t) := / f(x) cxp(itx)dx. 



Study of the problem of finding A a (f) L ng\ for some continuous functions / was 
initiated by M. Krein [13] (see also [1, Sec. 87]) in 1938 who proved the following 
result: 

Theorem 1.1. Let a continuous function f satisfy the inequality |/(x)| < C(l + 
x ) , x e R, and let there exist a number a G [0,7r/cr] and an entire function 
g a G B a n L(R) such that the following M a -condition holds: the product sin[er(x — 
a)](f(x) — g<y{x)) does not change its sign for all x G R. Then g a is an entire 
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function of best approximation to f in L(M) and 



Mf)i 



f{x) sgnsin[cr(a; — a)]dx 

£ 2^lM^ M(2fc+1)CT H/)((2fc + 1)<t)) 



fe=0 



Note that the M CT -condition of Theorem 1.1 means that g a interpolates / at 
equidistant points of M. and the difference / — g a changes its sign at these and only 
these points. We also remark that a cannot be chosen arbitrarily in Theorem 1.1 
since a is a solution of the equation 



J2 (-l) k f(a + k7r/a)=0. 



fc= — oo 

In 1939 Nagy [16] (see also [1, Sec. 88]) found classes of odd and even functions 
on K that satisfy the M CT -condition. 

Theorem 1.2. (a) Let f be an even function from L(R) such that its cos-transform 
T c (f) is twice differentiable on R, thrice differentiable on [a, oo), and for t> a, 

Then f satisfies M a -condition with a — ir /(2a) and 

MfhoQ - - 2^(-i) ^ T1 • (i.i) 

fe=0 

(b) Let f be an odd function from L(R) such that its sin-transform !F s (f) is twice 
differentiable on IR and for t > a, 

W)M>o, ^^<o. ^P>o. 

Then f satisfies M a -condition with a = and 

_4^ ^(/)((2fc+l)<7) 

fe=0 

Theorems 1.1 and 1.2 have been used in approximation theory for finding exact 
constants of approximation on convolution classes (see [1, 20, 12]). 

Since 1985 Vaaler and his students have published several papers on best approx- 
imation to some locally integrable functions. Their research has been influenced 
by applications of these results to some problems of number theory. In particular, 
Vaaler [21, Th. 4] established the relation 

A r7 (sgnx) L ( R )=ir/a. (1.3) 

Littmann [14, Th. 6.2] generalized this result by proving the relation 

in! ^ (_i)fcr 



A a (sgn(x) x n ) m = ---^ ]T I ' , n = 0, 1,... (1.4) 

k=0 y ' 

Since J2T=o( 2k + l )~ 2 = * 2 / 8 > ( L3 ) follows from (1.4). 
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In recent paper [4, Sec. 1] Carneiro and Vaaler established the following results: 
8r(A + l)sin(^A/2)^ (_i)«= 



4t(M A )l(R) 



^ CT (log|a;|) L( 



r A+2 



4 oo^ ^_^ fc 

^^ o (2feTIF- 



E 

fc=0 



(2fc + l) A + 2 ' 



|A|<1, (1.5) 



(1.6) 



Note that Theorems 1.1 and 1.2 cannot be applied to this functions since all of 
them do not belong to L(R). 

In this paper we extend Theorems 1.1 and 1.2 to locally integrable functions on 
W. and show that relations (1.3) through (1.6) are easy corollaries of these results. 
Other examples are discussed as well. 

2. Krein- and Nagy-type Theorems for Locally Integrable Functions 

We first discuss a more general version of Theorem 1.1. 

Theorem 2.1. Let f be a locally integrable function on M. and let there exist a 
number a e [0,7r/cr] and an entire function g a € B a such that f — g a G L(R) and 
the following M* -condition holds: the product sin[<r(x — a)](/(x) — g a (x)) does not 
change its sign for a. a. x G BL Then g a is an entire function of best approximation 
to f in L(M) and 



Mfh 



(f(x) — ga-(x)) sgnsin[cr(a; — a)]dx 



Proof. If g G B a n L(M), then T{g)(t) is continuous on R. Therefore, by the Palcy- 
Wiener theorem [19, Th. 7.2.1], !F(g)(t) = for \t\ > a. Next using properties of 
Fourier sums of functions of bounded variation over (0,27r) (see [22, Th. 2.8.1] and 
[1, Sec. 53]), we have 

4 ^ sin[(2fc + l)q(x-a)] 

- km > =sgnsino-(x-a) , x G 

7f n— >oo z — * m.i 



fe=0 



2fc+l 



and 



sup sup 

xGB. n>0 



4 -A sin[(2fc+l)cr(x-a)] 



k=0 



< 00. 



2fc + l 
Then by the Lebesgue dominated convergence theorem, for any g G L(M) we have 

\f(x) - 9a{x) - g(x)\dx > 

4 



lim 

n — >oo 7T 



(f(x) - g a (x) - g(x))sgnsin[a(x - a)}dx 
(f(x) - g A x) - ,(*)) ± M(2k + lM x - a)] dx 



fe=0 



lim '- [(m-9Ax))£ Sin[{2k t 1)a{X - a)] d:r 

•1 — >nd IT I™. ' ' 



k=0 



2fc+l 



(f(x) — g a (x)) sgnsin[cr(a; — a)]dx 
i 

This proves Theorem 2.1. 



\.f( x ) - 9a{x)\dx. 



D 
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Remark 2.1. Note that Theorem 2.1 strengthens Theorem 1.1 even in the case 
/ £ L(R) since we replace a condition |/(x)| < C(l + x 2 )^ 1 by a weaker condition 
/ £ L(R). In addition, note that the proof of Theorem 2.1 is similar to that of 
Theorem 1.1 (cf. [1, Sec. 87]). We remark also that a different version of Theorem 
2.1 was proved earlier by the author [5]. 

Next we prove a Nagy-type theorem for locally integrablc functions. 

Theorem 2.2. (a) Let f be an even locally integrable function on R, which is a 
tempered distribution, and let for some a > the restriction to (—oo, — <To)U(c , oo) 
of the cos-transform T c (f) of the tempered distribution f be a thrice differ entiable 
function. If J~ c {f) satisfies the following conditions for t > <7o-' 

W)M>o, <™<o, ^™>o, ^™>o. (2.1) 

and 

lim jF c (/)(i)=o, (2.2) 

£ — >-oo 

iften / satisfies M* -condition with a — it /(2a) and a > tr . In addition, 

DC 



, m 4^ fc ^ c (/)((2fc + l)q) 

^(/) i (R) = -2.(- 1 ) 2TTT 

fe=0 



^ Let f be an odd locally integrable function on M., which is a tempered distribution, 
and let for some <j$ > the restriction to (—00, — Co) U (<jq, 00) of the sin-transform 
Fsif) of the tempered distribution f be a twice differentiable function. If J- S {f) 
satisfies the following conditions for t > o~q: 

F M t»0, «<0, «>>0, (2.3) 

and 

lim T s {f){t) = 0, (2.4) 

t — ► OO 

then f satisfies M* -condition with a = and a > a®. In addition, 
A m A^ T s {f){{2k + l)o) 

k=0 

Proof, (a) Let a > erg- We first find a function h a £ B a such that / — h a £ L(R). 
Setting r :— (co + cr )/2, we extend !F c (f)(t) from (—00, — r] U [t, 00) to K by the 
formula 

™ - { Sir U! ; ;, <") 

where 

. . tT « - jM 4 -rjr(2) + 3 jr(i) 2 T 2^(2) _ 5r jr(i) + 8J r(o) 

*M*):= ^ * + ^ f + § 



is the Hermite polynomial satisfying the relations 

s = 0,1,2. 



pj'\±r) = ^H±r):=^ [ % m] 



t = ±T 
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Then F is an even and bounded function on R. Moreover, it is a twice diffcrcntiablc 
function on M. and a thrice diffcrcntiablc function on [r, oo), which satisfies the 
conditions 

F(t) > 0, F'(t) < 0, F"(t) > 0, F'"(t) < 0, t > t, (2.6) 

and 

lim F(i) = 0. (2.7) 

£ — *oo 

Next, it follows from (2.5) and (2.6) that F' E L{R) and F" E L(R). Hence 
integrating by parts and taking account of (2.7), we have for x ^ 

,-A 



T c (F)(x)= lim / F(t)cosxtdt 



I F(i) sinxt 

lim ' 



A 
A -. -A \ 

/ F'(t) sin xtdt 

A^oo \ X „ X 

= -- F'(t)smxtdt= -J / F"(t) cos xtdt. 

x JWL X JWL 

Hence T c (F)(x) exists for every x ^ and 

\F c (F)(x)\<Cx- 2 , x^0. (2.8) 

Further setting ip(x) := (2n)^ 1 T c (F)(x) , we shall show that h a := f — tp belongs to 
B a . Indeed, if is a tempered distribution since it is the cos-transform of a bounded 
continuous function (27r) -1 F(£) on R. Hence 

?c(<p) - F. (2.9) 

Then h a is an even tempered distribution, and its cos-transform Tdjia) is defined 
as the functional (h a ,!F c (Tpy), where ip is an even rapidly decreasing function from 
the Schwartz class S. Therefore, if ip E S and ip = on [— n, ri], where n G (r, a), 
then by (2.5) and (2.9), 



(A f /cW) = (//cW)-(v/cW)= / (r c tf){t)-F(t))ii>{t)dt = Q. 

J\t\> Tl 

Thus the support of !F c (h a ) is a subset of [—a, ct]. Using now the generalized Palcy- 
Wiener theorem [19, Th. 7.2.3], we arrive at h a E B„. Next, 

f(x) - h a {x) = (2tt)- 1 T c (F)(x), (2.10) 

so (2.8) and (2.10) imply 



/ 



\f{x)-h a {x)\dx < I \f(x)\dx+ \K){x)\dx 

\x\<l J\x\<l 

|/(x) — h a (x)\ dx < oo. 

|x|>l 



Thus tp = f - ha- E L(R). Moreover, since <p E L(R), identity (2.9) holds not 
only in the distributional sense but also as a formula for the cos-transform of an 
integrable function. Hence taking account of (2.5), (2.6), and (2.9), we conclude 
that tp satisfies all the conditions of Theorem 1.2(a). Therefore, tp satisfies M CT - 
condition for a = it/(2<j), that is, there exists G a E B a n L(R) such that the 
function coscra;[/(x) — h a (x) — G a (x)] does not change its sign on R. Therefore, 
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we conclude that / satisfies M*-condition for a = ir/(2a) and g a := h a + G a . 
Moreover, by (1.1), (2.5), and (2.9), 

A m A (f h\ 4 Vr , Mf-h a ){{2k + l)a) 
A a {}) L (R) = Ar(/ -n a ) L (w) = - 2_^\ l > 2 fc + 1 

fc=0 

4 ~ fe ^(/)((2fc + l)a) 

7T^ 1 j 2fc+l 

fc=0 

This proves statement (a) of Theorem 2.2. 

(b) The proof of this statement is similar to that of Theorem 2.2(a) if we replace 
^c(f) by T s {f) and Pi{t) by the polynomial 

t 2 ^( 2 ) - 3r^« + 3JF(0) _ T 2^(2) + 5r ^(i) _ 5 ^(o) 

5W 8r 5 4t 3 

t 2 ^( 2 ) - 7r^« + 15^(°) 

+ 87 *■ 

Therefore, Theorem 2.2 is established. □ 

Remark 2.2. Note that Theorem 2.2 strengthens Theorem 1.2 even in the case 
/ G L(R) since we drop the conditions that T c (f) and ? r s (f) should be twice 
diffcrcntiable on R. We remark also that a special case of Theorem 2.2 for <7o = 
was proved earlier by the author [5]. 

3. Examples 
Example 3.1. /„ sgn (a:) '■= sgn(x) x n , n = 0, 1, 2 . . . 
Then the Fourier transforms of / n , S gn are given by the formulas [3, Sec. 10.1, #6] 

^c(/ n , sg „)W = 2n!(-l)("+ 1 )/ 2 t-("+ 1 ), t>0, n = l, 3, 5,..., 
^(/n, sg n)(t)-2n!(-l)"/ 2 t-(" +1 ), i>0, n = 0,2,4,... (3.1) 

Since for odd n > 1, (— 1)(™ +1 )/ 2 / n]Sgn satisfies all the conditions of Theorem 2.2(a) 
and for even n > 0, (— l)™/ 2 /„ iSgn satisfies all the conditions of Theorem 2.2(b), we 
arrive at (1.4). 

Example 3.2. f x {x) := \x\ x , A e (-1, oo), A # 0, 2, 4, . . . 

Then by [3, Sec. 10.1, #11], 

^"c(/A)(t) = -2sin(A7r/2)r(A + l)t- (A+1) , t > 0. 

Since — sin(A7r/2)/A satisfies the conditions of Theorem 2.2(a), we obtain 

A(f , _ 8|sin(A7r/2)|r(A + l) ^ (-l) fc 



fe=0 



(2fc+l) A + 2 ' 



Remark 3.1. The first direct proof of this result was given in [5]. Equality (3.2) 
can be also obtained as a corollary of a limit theorem for polynomial approximations 
in the integral metric [18] and i-approximation asymptotic results by Nikolskii [17] 
and Bernstein [2] (see [6, 7, 15, 10] for more details). A different proof of (3.2) for 
|A| < 1 was given in [4]. 
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Example 3.3. f\, sgn (x) ■= sgn(x) \x\ x , A e (-1, oo), A ^ 1, 3, 5, . . . 
Then by [3, Sec. 10.1, #12], 

WA, S gn)(i) = 2cos(A7r/2)r(A+ l)t^ x+1 \ t > 0. 
Since cos(Xw/2)f\ satisfies the conditions of Theorem 2.2(b), we obtain 
A(f , _ 8|cos(A7r/2)|r(A + l) ^ 1 

Ax(/A,sgn)L(R) - ~^+l 1^ ( 2 fc + l)A+2- ^.3) 

Example 3.4. /„,i og (a;) := x"log |x|, n = 0, 1,2... 
Then by [3, Sec. 10.1, #s 22,23], 

Wn,iog)(t) - n(-l) n / 2 nlt-^ +l \ t>a o >0, n = 0, 2, 4, . . . 
Mfn,io S )(t) = 7r(-l)(" +1 )/ 2 n!t-(" +1 ), t>a o >0, n = 1, 3, 5, . . . 

Using Theorem 2.2, we get 

4n! °° f— l) fe 

4,(/n,log)L(R) = ^TtE7^TtW ^ = ' 2 ' 4 ---- ( 3 - 4 ) 

4n! °° 1 

^<7(/n,log)z,(R) = ^Ti zZ (2k + 1)"+ 2 " = 1,3,5,... 



fc=0 



A different proof of (3.4) for n=0 was given in [4]. A two-dimensional version of 
(3.4) was established in [8]. 

Example 3.5. /| arc := (2/n) arctan(x//3), /? > 0. 

We first note that 

W£arc)W := 2t- 1 e-' 3 ', « > 0. 

Indeed, using (3.1) and [11, Sec. 4.57], we have 

WiVcX*) = ^(sgncc)(i) - (2/7r)^ s (arctana;)(t) = 2/t - (2/t)(l - e"*) - 2t- 1 e~ t . 

Since /J arc satisfies the conditions of Theorem 2.2(b), we obtain 



8 °° 1 

4.(/£arc)i(R) = — g (2fc+1) 2 £ 



(2fc + l) 2 e^( 2fe + 1 )' 
Note that for all x G R, lim / g_,o+ /a a rc( x ) = s S nx an d 

li + m + ^<r(/,3 : arc)L(R) = At (sgn x) L (R) = 7r/er. 

Remark 3.2. Similarly to Examples 3.1-3.5, we can find j4 ct (/)l(r) f° r functions 
such as sgn(x)x™log |x|, |x| A log \x\, sgn(x) |x| A log \x\, A # 0, 1,2,..., as well as 
more general functions of the form |x| A log |x| and sgn|x| A log \x\. Note that 
some of these results were recently discussed in [9, pp. 94, 95]. 
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Abstract 

If p(z) is a polynomial of degree n and p'(z) its derivative, then it 
is well known that 

max |p'(z)| < n max |p(z)|. 
W=i M=i 

Also, for an entire function /(z) of exponential type r, it was proved 
by S. N. Bernstein that 

sup |/'(x)|<t sup |/(x)|. 

— oo<x<oo — oo<:r<oo 

Both the above inequalities are sharp, and have been the starting point 
of a considerable literature in Approximation Theory. In this paper we 
will discuss some of the research centered around above inequalities for 
some classes of polynomials, including self-reciprocal polynomials, and 
the extension of some of these results for entire functions of exponential 
type, and for uniformly almost periodic functions. 

Key Words: Self-reciprocal polynomials, Entire functions of exponential 
type, Uniformly almost periodic functions. 
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1 Introduction 



Some years after the chemist Mendeleieff invented the periodic table of the 
elements he made a study of the specific gravity of a solution as a function 
of the percentage of the dissolved substance [38] . This function is of some 
practical importance: for example it is used in testing beer and wine for 
alcoholic content, and in testing the cooling system of an automobile for 
concentration of anti-freeze; but present-day physical chemists do not seem 
to find it as interesting as Mendeleieff did. Nevertheless Mendeleieff 's study 
led to mathematical problems of great interest, some of which are even today 
inspiring research in Mathematics. 

An example of the kind of curve that Mendeleieff obtained was specific 
gravity of a solution in terms of the percentage of alcohol. He noticed that 
such curves could be closely approximated by successions of quadratic arcs 
and he wanted to know whether the corners where the arcs joined were really 
there, or just caused by errors of measurement. In mathematical terms, this 
amounts to considering a quadratic polynomial P(x) = px 2 + qx + r with 
|-P(x)| < 1 for — 1 < x < 1, and estimating how large can |P'(x)| be on —1 < 
x < 1 (for details, how the Mendeleieff's problem in Chemistry amounts 
to this mathematical problem in polynomials, see Boas [7]). Surprisingly, 
Mendeleieff himself was able to solve this mathematical problem and prove 
that |P'(x)| < 4; and this inequality is best possible, since when P{x) = 
1 — 2x 2 we have |-P(x)| < 1 for — 1 < x < 1 and |P'(±1)| = 4. By using this 
result Mendeleieff was able to convince himself that the corners in his curve 
were genuine; and he was right, since his measurements were quite accurate, 
as they agree with modern tables to three or more significant figures. 

Mendeleieff told his result to a Russian mathematician A. A. Markoff, 
who investigated the corresponding problem in a more general setup, that 
is, for polynomials of arbitrary degree n. He [37] proved the following result 
which is known as Markoff 's Theorem. 

THEOREM 1.1 If p(x) = ^™_ a>vX u is a real polynomial of degree n and 
\p(x)\ < 1 on [—1,1] then 

\p'(x)\ <n 2 for - 1 < x < 1. (1.1) 

The inequality is best possible and is attained at only x = ±1 only when 
p{x) = ±T n (x), where T n (x) (the so called Tchebycheff polynomial of the 
first kind) is cos(ncos _1 x) (which actually is a polynomial, since 
cos n6 is a polynomial in cos 6). In fact T n (x) = cos(ncos _1 x) = 
2 n " 1 n™ =1 {x - cos((^ - \)-n/n)}. 
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It was several years later, around 1926, when a Russian mathematician 
Serge Bernstein needed the analogue of Theorem 1.1 for the unit disc in the 
complex plane instead of the interval [—1,1]. He wanted to know if p{z) 
is a polynomial of degree at most n (by a polynomial of degree at most n 
we mean an expression of the form Ylv=o a v zU ' a v being complex and z a 
complex variable) with \p(z)\ < 1 for \z\ < 1, then what is maximum of 
\p'(z)\ for |*| < 1? 

The answer to this is given by the following which is known as Bernstein's 
inequality [4]. 

THEOREM 1.2 Ifp(z) = J2u=o a ^ zU ^ a polynomial of degree at most n, 
then 

max \p (z)\ < nmax |p(z)|. (1-2) 

\z\<l |z|<l 

The result is best possible and the equality holds for p{z) = \z n , A being a 
complex number. 

The above Bernstein's inequality has an analogue for trigonometric poly- 
nomials which states that if t(9) = Xl™=-n a v& %ve is a trigonometric polyno- 
mial (possibly with complex coefficients) of degree n, \t(9)\ < 1 for < 9 < 
2vr then for < 9 < 2vr, 

\t'{9)\<n. (1.3) 

In (1.3) equality holds if and only if t{9) = e* 7 cos (n6 — a), where 7 and a 
are arbitrary real numbers. 

Note that a trigonometic polynomial t(9) = Ylu=-n c u& lud (possibly with 
complex coefficients) is said to be real if c v = c- v . 

Inequality (1.3) is also known as Bernstein's inequality although Bern- 
stein [4] proved (1.3) with 2n in place of n. His proof was based on a varia- 
tional method. Inequality (1.3) in the present form first appeared in print in 
a paper of Fekete [19] who attributes the proof to Fejer [17]. Bernstein [3] 
attributes the proof to E. Landau (see Schaeffer [51] and Fejer [18] ). Al- 
ternative proofs of the inequality (1.3) have been supplied by F. Riesz [47], 
M. Riesz [48], de la Valee Poussin [55], Rogosinski [49] and others, and each 
of these methods has led to interesting extensions of the inequality (1.3). 

If p{z) = 2~X'=o a v zV i s a polynomial of degree at most n, then t(9) = 
pie ) is a trigonometric polynomial of degree n with \t(9)\ < 1, real 9, hence 
applying (1.3) to t{9) = pie ) one can get Theorem 1.2. 

Bernstein needed the above inequalities in order to answer the following 
question of best approximation raised by de la Valee Poussin in the early 
part of last century; Is it possible to approximate every polygonal line by 
polynomials of degree n with an error of o(l/n) as n becomes large? (The 
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result that the approximation can be carried out with an error of 0(l/n) 
was proved by de la Valee Poussin himself). This problem has played an im- 
portant role in the development of the theory of approximation and was an- 
swered in the negative by S. Bernstein [4]. Several monographs and papers 
have been published in this area (see Boas [7], Durand [15], Govil and Mo- 
hapatra [30], Rahman and Schmeisser [42], Rassias [46], Telyakovskii [53], 
and Voronovskaja [56]). 

The inequality (1.2) and the corresponding inequality for entire functions 
of exponential type (which was proved by Bernstein himself) have been the 
starting point of a considerable literature in approximation theory and in 
this paper we study some of the research centered around these inequalities. 
This paper consists of three sections, including Section 1, which is an intro- 
duction. In Section 2, we discuss some inequalities analogous to (1.2) for 
some classes of polynomials including self-reciprocal polynomials, while in 
Section 3 we deal with analogous inequalities for entire functions of expo- 
nential type for some of the results in Section 2, and a result for uniformly 
almost periodic functions. 



2 Polynomials 

We begin with Bernstein's inequality (1.2). In view of the maximum mod- 
ulus principle, it states equivalently that for a polynomial p{z) of degree at 
most n, 

max|p'(z)| < nmax|p(z)|. (2-1) 

1*1=1 1*1=1 

The above inequality is best possible with equality holding for p{z) = Xz 11 , 
for every complex number A. It was conjectured by Erdos [16] that if a poly- 
nomial p{z) has no zeros in \z\ < 1, then maxu =1 |p'(£)| < 
(n/2) maxi 2 i =1 |p(z)|. This conjecture was proved in the special case when 
p{z) has all its zeros on \z\ = 1 independently by Szego and Polya (see 
Lax [35]). In the general case the conjecture was established for the first 
time by Lax [35], who proved 

THEOREM 2.1 If p{z) is a polynomial of degree n, p{z) ^ for \z\ < 1, 

then 

n 
max\p'(z)\ < -max|p(z)|. (2.2) 

1*1=1 l |*|=i 

The result is best possible and the equality in (2.2) holds for any polynomial 
which has all its zeros on \z\ = 1. 
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Simpler proofs of this result were given by de Bruijn [13] and Aziz and 
Mohammed [2]. For some generalizations of Theorem 2.1, see Boas [8], 
Gardner and Govil [23] and Rahman [40]. 

Professor R. P. Boas, Jr. raised the question that "How large can the 
bound in (2.1) be if p(z) has k zeros on or outside the unit circle?" In this 
connection, Giroux and Rahman [33] proved 

THEOREM 2.2 For every positive integer n, there exists a polynomial 
p{z) of degree n having a zero on \z\ = 1, such that 

maxlp (z)\ > (n — c/n) max |p(z)|. (2.3) 

|z| = l \z\=l 

On the other hand for an arbitrary polynomial p{z) of degree n having a 
zero on \z\ = 1, they proved 

max|p'(z)| < (n )max|p(z)|. (2.4) 

\z\ = l 47T7J |z| = l 

Also, it was proposed by R. P. Boas, Jr. to obtain inequalities analogous to 
(2.1) for polynomials having no zeros in \z\ < K, K > and the following 
partial result in this connection was proved by Malik [36]. 

THEOREM 2.3 If p{z) is a polynomial of degree at most n having no 
zeros in \z\ < K , K > 1, then 

max|p'(z)| < (— — ) max \p(z) | . (2.5) 

|z| = l 1 + A \z\=l 

The result is best possible with equality holding for p{z) = (z + K) n . 

Govil and Rahman [31] generalized Theorem 2.3 of Malik [36] for any 
order derivative of the polynomial p{z) and proved 

THEOREM 2.4 If p{z) is a polynomial of degree at most n, having no 
zeros in \z\ < K, K > 1, then 

, t^, m n(n — 1) ■ ■ ■ (n — s + 1) . . , , . 

max p {s > (z) < -i '- i '- max p(z) . (2.6) 

\z\=i 11 y M ~ 1 + K S \z\=l 

For s = 1, (2.6) obviously reduces to (2.5). 

Another generalization of (2.2) was later given by Chan and Malik [10], 
who proved 

THEOREM 2.5 If p{z) = ao + J2v=u a v zl/ ^ s a polynomial of degree at 
most n, p{z) ^ in \z\ < K, K > 1, then 

max\p'(z)\ < -—m&-K\p(z)\. (2.7) 

|z|=l^ V Jl - l + £> \z\=l lFy M ^ ' 
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The equality in (2.7) is attained for p{z) = (z^ + K^) n '^, n being a multiple 
offi. 

The inequality (2.7) in the case \x = 2 can also be found in Govil [24]. 

For some time it was believed that if p{z) ^ in \z\ < K, K < 1, then 
the inequality analogous to (2.5) should be 

n 

max\p'(z)\< -— max|p(z)|, (2.8) 

\ z \=i ' V Jl ~ 1 + K n \z\=i ] y 

till E. B. Saff gave the example p(z) = [z — \){z+ |) to counter this belief. 
As can be easily verified for this polynomial, the left hand side of (2.8) is 
approximately 2.1666 while the right hand side is j — (1 2 maxi 2 i =1 \p(z)\ ~ 

2.144 < 2.166 and so (2.8) cannot hold true. 

Thus the problem in the case p{z) ^ for \z\ < K, K < 1 is still open. 
However, for polynomials having all their zeros in \z\ < K, the problem is 
solved for all K > 0, and following result in this direction is due to Govil [25] . 

THEOREM 2.6 If p{z) is a polynomial of degree n having all its zeros in 
\z\ < K, then 



max In (z)\ > < 
1*1=1 



n 

max|p(z)| if K < 1 

1 + K \z\=i 1 ^ n 

Tl 

max|r>(z)| if K > 1 



Both the above inequalities are best possible. 

The case K < 1 of the above theorem was also proved by Malik [36], 
and the case K = 1 by Turan [54] . A simpler proof of the above result in 
case K > 1 was provided by Datt [11]. 

If p(z) is a polynomial of degree n, it is obviously of interest to obtain 
an inequality analogous to Bernstein's inequality for polynomials satisfying 
p(z) = z n p(l/z) or p(z) = z n p(l/z). 



The polynomials satisfying p(z) = z n p(l/z) are known as self-conjugate 
polynomials, and for this class of polynomials the following result (see Govil 
[25], O'Hara and Rodriguez [39] and, Saff and Sheil-Small [50]) is well known. 

THEOREM 2.7 // p{z) is a polynomial of degree n satisfying p{z) = 
z n p{l/z), then 

Tl 

max|p'(z)| = -max|p(z)|. (2.9) 

kl=i z M=i 

Let n n denote the class of polynomials of degree n satisfying p{z) = 
z n p{\/ z). This class is interesting because for any polynomial p{z) of degree 
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n, the polynomial P(z) = z n p{z + 1/z) is always in Ii-2 n . A polynomial 
belonging to the class IT„ is known as a self-reciprocal polynomial. 

It was proposed by Professor Q. I. Rahman to obtain inequality analo- 
gous to Bernstein's inequality (2.1) for polynomials belonging to Il n , and in 
an attempt to answer this question Govil, Jain and Labelle [29] proved the 
following partial result. 

THEOREM 2.8 If p(z) is a polynomial belonging to Ii n and having all 
its zeros in the left half-plane or in the right half-plane, then 

max \p'{z) | < — = max \p(z) \ . (2.10) 

M=i V2 M=i 

It is not known if (2.10) is best possible, however by considering p(z) = 
z n + 2iz n ' 2 + 1, n being even, they showed that if p(z) simply belongs to H n , 
then the bound in (2.10) can not in general be smaller than nj\2. 

In fact Ruscheweyh considered the polynomial p(z) = {l+iz) 2 -\-z n ~ 2 (z-\- 
i) 2 . Note that this polynomial belongs to H n and on \z\ = 1, \p(z)\ < 
|1 + iz\ 2 + \z + i\ 2 = | - i + z\ 2 + \z + i\ 2 = 4, while \p'{i)\ = 4(n - 1) 
and so maxui =1 |p'(z)| > 4(n — 1). Thus maxi 2 i =1 |p'(z)|/max| 2 | =1 \p(z)\ > 
(4(n — l))/4 = (n — 1) > n/v2, implying that if p{z) only belongs to Il n , 
the bound in (2.10) should be something greater than n/y/2. 

Aziz [1] considered another subclass of Il ra and proved 

THEOREM 2.9 Let p{z) = E"=oK + i0v)z v , &u > 0, U > 0, v = 

0, 1, 2, . . . n be a polynomial belonging to II„. Then 

Tl 

max\v'(z)\ < —=m&x\p(z)\. (2.11) 

|*|=i V2\z\=i 

The equality in (2.11) again holds for the polynomial p(z) = z n -\-2iz n ' 2 + 1, 
n being even. 

As is easy to observe, the hypothesis of Theorem 2.9 is equivalent to 
that p(z) belongs to Il n and that all the coefficients of p(z) = Yl™=o avZ/V 
lie in the first quadrant of the complex plane. In fact, if all the coef- 
ficients of a polynomial p(z) belonging to II ra lie in a sector of opening 
it/2, say in, tp < arg z < ip + vr/2, for some real ip, then the polynomial 
P(z) = e~ l ^p(z) belongs to II n and has all its coefficients lying in the first 
quadrant of the complex plane. Since maxui =1 |P(;z)| = maxui =1 \p(z)\ and 
maxi 2 i =1 |P'(-z)| = maxui =1 |p'(z)| we may apply Theorem 2.9 to P(z) to get 
that if p{z) £ Il n and has all its coefficients lying in a sector of opening at 
most 7r/2, then also (2.11) holds. The following result that is equivalent to 
this statement appears in Jain [34]. 
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THEOREM 2.10 Letp(z) = E"=o a ^ where a u = a v e i(t> + (3 u e^ , a v > 0, 

(3 U > 0, 1,2, ..., n, < \(f) — ip\ < it/2, be a polynomial of degree n. If further 

p{z) € II n , then 

n 
m&x\p'(z)\ < —=max\p(z)\. 

\z\=i v^ W=i 

The result is best possible with equality holding for the polynomial p{z) = 
z n + 2iz n ' 2 + 1, n being an even integer. 

Later, Datt and Govil [12] proved the following result which can yield 
both the above results Theorem 2.9 and Theorem 2.10. 

THEOREM 2.11 Let p{z) = YZ=o( a v + iPv)z v be a polynomial of de- 
gree n belonging to LT n . If on \z\ = 1, the maximum of \ s ^]1, = qOi v z v \ and 
I Ylu=o fiv zV \ i- s attained at the same point, then 

n 
max|j/(z)| < -^=max|p(z)|. 

\z\=i V2 |z|=i 

The equality here holds again for p{z) = z n + 2iz n < 2 + 1, n being an even 
integer. 

Govil and Vetterlein [32] obtained a bound for maxu =1 |p'(-z)| which 
depends on the opening of the sector containing all the coefficients of a self- 
reciprocal polynomial and includes as special cases Theorem 2.9 and 2.10. 
Further, their result is applicable even when the opening of the sector is 
greater than ir/2. More precisely, their result is 

THEOREM 2.12 Let p{z) = s ^I = qO ju z v is a polynomial belonging to Ii n , 
with its coefficients lying in a sector of opening 7 with vertex at the origin 
where < 7 < 2ir/3, then 

n 

m&x\p'(z)\ < ; — — max |p(z)|. 

| 2 |=i l/n ;| -2cos(7/2)| 2 |=i'^ n 

The result is best possible with equality holding for the polynomial p{z) = 
z n + 2iz n ' 2 + 1, n being an even integer. 

Rahman and Tariq [44] observed that in the above theorem a sharp esti- 
mate of maxu =1 |p'(^)|, that is valid for 7 in [0,7r) instead of [0,27r/3], can 
be given in terms of \p(l)\ and that the proof given by Govil and Vetterlein 
[32] can be easily modified to prove the following 

THEOREM 2.13 Let p(z) = J2u=o a v zU be a polynomial belonging to Il n , 
with its coefficients lying in a sector of opening 7 with vertex at the origin 
where < 7 < it, then 

max|j/(s)|<- r7 ^|p(l)|. (2.12) 

|z|=i 2cos(7/2j 
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In the case where n is even, the polynomial p(z) := z" + 2e 17 z n ' 2 + 1 shows 
that the above inequality is sharp for any 7 E [0,7r). 

Although the class H n of polynomials has been extensively studied among 
others by Frappier and Rahman [20] and Frappier, Rahman and Ruscheweyh 
[21], the problem of obtaining a sharp inequality analogous to Bernstein's 
inequality (2.1) is still open for polynomials of degree n > 3. However, the 
following sharp inequality in the reverse direction, which is easy to obtain, 
is due to Dewan and Govil [14]. 

THEOREM 2.14 If p{z) is a polynomial belonging to II ra , then 

max|p(2;)| > — max|p(z)|. (2-13) 

|z|=l 2 |z|=l 

The result is best possible and the equality holds forp(z) = (z n + 1). 

It may be remarked that several of the above mentioned results for poly- 
nomials have been extended to entire functions of exponential type, and this, 
we take up in the next section. 



3 Entire functions of exponential type and uni- 
formly almost periodic functions 

In this section we will discuss the extension of some of the results on poly- 
nomials mentioned in Section 2 to entire functions of exponential type, and 
of Theorem 2.13 to entire functions of exponential type that are uniformly 
almost periodic on the real line. We start with the following definitions. 

Let / be an entire function and r be any positive real number. We will 
denote 

M(r) = Mf(r) :=max|/(z)|. 

\z\=r 

The order (or the order of growth) of an entire function /, denoted by p, is 
defined by 

log log Mir) 

p := limsup . 

rwoo log r 

It is a convention to take the order of a constant function of modulus less 
than or equal to one as 0. An entire function of finite order p is said to have 
type T, where T is given by 

rr, y log M(r) 

1 := limsup ■ 



r /> 
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DEFINITION 1 An entire function / is said to be of exponential type r 
if for every e > there is a constant k(e) such that |/(^)| < k(e) e( T+£ )\ z \ for 
all z£C. 

It is clear that entire functions of order less than 1 are of exponential 
type r, where r can be taken to be any number greater than or equal to 0. 
Also entire functions of order 1 and type T < r are of exponential type r. 

Examples of entire functions of exponential type includes polynomials 
with complex coefficients, sinrz, cosrz etc. 

DEFINITION 2 Let / be an entire function of exponential type. The 

function 

loel f(re w )\ 
h f (9) :=limsup-^^ !i i o<0 < 2tt. (3.1) 

r— +oo T 

is called the indicator function of /. It describes the growth of the function 
along a ray {z\ argz = 9}. It is finite or — oo. Unless hf{9) = — oo, it is a 
continuous function of 9. 

Bernstein himself (see [3], p. 102) found the extension of inequality (2.1) 
for the entire functions of exponential type that are bounded on the real 
line. He in fact proved 

THEOREM 3.1 Let f be an entire function of exponential type t, bounded 
on the real axis. Then 

sup \f(x)\ <r sup |/(x)|. (3.2) 

— 00<X<00 — 00<IE<00 

The estimate is sharp. 

Equality holds for f{z) = sinrz. In fact equality holds if and only if f{z) = 

ae iTZ + be~' 1TZ , where a, b G C and \a\ + |6| > 0. 

If p(z) := Y^l=o c " zV ^ s a polynomial of degree at most n with complex 
coefficients, then f{z) = p(e lz ) is an entire function of exponential type n. 
Furthermore, if we assume that p{z) has no zero in the open unit disk \z\ < 1 
then f{z) will have no zero in the open half plane Im(z) > 0. Also 

, /fl\ .. log|/(iy)| 
hf{—) := hmsup = 

2 j/-+oo y 

In the light of above and Theorem 2.1, the following result of Boas [8], see 
also [40], provides the generalization of Theorem 2.1 for entire functions of 
exponential type. 

THEOREM 3.2 Let f be an entire function of exponential type r, bounded 
on the real axis, f(z) / for y > 0, and hf(n/2) = 0. Then 

sup |/'(x)| < T - sup |/(x)|. (3.3) 

— oo<x<oo ^ — oo<x<oo 
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The inequality is sharp. 

The function f(x) = (l+e 1TZ )/2 serves as an extremal function for 
the inequality. We observe that |/(x)| = |(l + e 1TZ )/2| < 1 for all x £ R 
and /(0) = 1. So sup_ 00<:r<00 \f(x)\ = 1. f(x) = if and only if x = 
(2n + 1)tt/t, where n is any integer. Thus / has only real zeros. 

vr tog|/(iy)| log( 1±i F) n 
hf(-) = limsup = limsup = (J 

2 j/^oo y y^oo y 



and 



Thus 



iTe 7" 

l/'(z)l = l^ - 1 = 2 for a11 x - 



sup I /'(a;) I = - sup \f(x)\. 

-00<X<00 ^ — 00<IE<00 



It is worth mentioning that the condition fo/(§ ) = in the above theorem 
is necessary as it can be seen from the function f(x) = cos tz which is an 
entire function of exponential type r having all the zeros on the real axis. 
That is f(z) / for y > 0, sup_ 00<a . <00 \f(x)\ = 1, 



TV ,, log(° 



hf(-) = limsup = r > 

2 j/— »oo y 

and 

|/'(ar)|=r = r sup |/(»)| for all i£l. 

— oo<x<oo 

It was proved by Rahman [41] that the equality holds in (3.3) for any entire 
function of exponential type r > satisfying conditions of the Theorem 3.2 
whose zeros are all real. 

Govil and Jain [28] studied class of entire functions of exponential type 
r > satisfying f{z) = e 1TZ f(z). They proved the following theorem for this 
class. 

THEOREM 3.3 If f is an entire function of exponential type r such that 
f(z) =e iTZ J(¥j, then 

sup \f{x)\ = T - sup \f{x)\. (3.4) 

— oo<a;<oo ^ — oo<ir<oo 

The above result provides an extension of Theorem 2.7 for entire functions 
of exponential type. 
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It is clear that if zq is a zero of / that satisfies the condition f{z) = 
e 1TZ f(z), then so is zq. Thus upper half plane Im(z) > has as many zeros 
of / as the lower half plane Im(z) < has. 

To see, what to expect as an extension of self-reciprocal polynomials, 
consider f{z) = p(e lz ), where p{z) is a self-reciprocal polynomial of degree 
re. The function / is an entire function of exponential type n satisfying the 
condition 

f( z )=e inz f(-z). 

This led Govil [27] to consider a class of entire functions of exponential type 
r that satisfies the condition f(z) = e 1TZ f{—z). He proved several results 
for functions in this class. One of his result which provides an extension of 
Theorem 2.14 for entire functions of exponential type is 

THEOREM 3.4 Let f be an entire function of exponential type t, satis- 
fying the condition f{z) = e 1TZ f(—z). Then 

sup \f'(x)\ > T - sup |/(x)|. (3.5) 

— oo<x<oo ^ — oo<a;<oo 

The result is best possible and the equality holds for f(z) = (1 + e 1TZ ). 

He deduced Theorem 3.4 as a corollary of another inequality that he 
proved for the functions in this class. However, the result can also be proved 
directly, and the proof is not difficult. 

Let us denote the ratio sup_ 00<:E<00 \f'(x)\/sup_ 00<x<00 \f(x)\ by q(f). 
Theorem 3.4 gives us the smallest value of q{f) if / satisfies the condition 
f{z) = e 1TZ f(—z). From Theorem 3.1, we already know that, q{f) could 
be as large as r and it will be equal to r only when the function is of the 
form ae 1TZ + be~ 1TZ . However, it is easy to see that functions satisfying the 
condition f{z) = e 1TZ f{—z) can not be of this form. So, again from Theorem 
3.1, q{f) < t for such functions. Thus the question is: "How large q(f) 
could be for such functions" ? 

Rahman and Tariq [45] have shown through an example that it could 
be as close to r as one wish. In fact the following result holds true. 

THEOREM 3.5 Given any number e € (0 , r), we can find an entire func- 
tion f £ of exponential type r which satisfies the condition f £ {z) = e 1TZ / e (— z) 
such that 

SU P \fs( x )\ > ( r - £ ) S UP \fe(x)\- 
— oo<a;<oo — oo<ir<oo 

It is clear that if zq is a zero of / that satisfies the condition f{z) = 
e 1TZ f(—z), then so is —zq. Thus upper half plane Im{z) > has as many 
zeros of / as the lower half plane Im{z) < has. 
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In view of the above observation, this result is a bit surprising because 
functions satisfying condition f{z) = e 1TZ f(—z) are in some sense similar to 
those discussed in Theorem 3.3. They both have as many zeros in the upper 
half plane as in the lower half plane. 

Rahman and Tariq [45] formulated and proved an extension of the re- 
sult of Govil and Vetterlien [32] to the entire function of exponential type. 
Theorem 2.13 requires the coefficients of a self-reciprocal polynomial to lie 
in certain sector. The main issue while deciding about the extension to the 
entire function of exponential type was: " What class of entire functions of 
exponential type would admit an extension of Theorem 2.13"? 

If we simply take functions of the form f(z) = p(e lz ) = J2u=o a v & vz 
and require coefficients to lie in a sector, then it is indeed an entire function 
of exponential type but too restrictive as an arbitrary entire function of 
exponential type, in general, cannot be expressed as a finite or infinite sum 
of the form ^ a v e luz . According to Rahman and Tariq [45] an appropriate 
class of entire functions of exponential type for which the Theorem 2.13 will 
admit an extension is the one whose elements are uniformly almost periodic 
on the real line. Under certain conditions, functions that are uniformly 
almost periodic on the real line may be represented as a sum ^ a u e uZ . 
They gave the following extension of Theorem 2.13 for entire functions of 
exponential type. 

THEOREM 3.6 Let f be an entire function of exponential type r such 
that f(z) = e 1TZ f(—z). Furthermore, let f be uniformly almost periodic on 
the real axis, with Fourier series f(x) ~ ^2^Li^-n^ , where the coeffi- 
cients A\,A2,... lie in a sector of opening 7 G [0 , vr) with vertex at the 
origin. Then 

sup |/'(x)| < T —- 1/(0)1 . (3.6) 

-oo<x<oo 2 cos (7/2) 

The example f(z) := e 1TZ + 2e 17 e lr2 ' 2 + 1 shows that the estimate is sharp. 

In the rest of the paper, we will present a brief overview of the relevant 
part of the theory of uniformly almost periodic functions that is needed for 
the understanding of Theorem 3.6. For the detail studies, we refer readers 
to [5], [9] and [45]. 

Let / : I -> C be a continuous function, and e an arbitrary positive 
number. A real number t = t(e) = t/(e) is called a translation number of / 
corresponding to e provided that \f{x + 1) — f{x)\ < £ for all real x. 

A set E of real numbers t is called relatively dense if there exists a 
positive number L such that any interval (a , a + L) of length L contains at 
least one number t of the set E. 
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DEFINITION 3 A continuous function / : R — ► C is called uniformly 
almost periodic, u. a. p. for short, if there exists a relatively dense set of 
translation numbers of / corresponding to any given e > 0. In other words, 
for any e > we can find a positive number L = L{e) such that an arbitrary 
interval of length L contains at least one translation number t {e) . 

It is easy to see that if / is u.a.p. then so is cf and f(x + c) for any 
constant c; since | |/(x + t)| — \f(x)\ \ < \f(x + t) — f(x)\, \f\ is u.a.p. if 
/ is; the sum and the product of u.a.p. functions are u.a.p. [5, pp. 2-6, 
12-15]. 

(i) Every periodic function is u. a. p.. (ii) u.a.p. functions are bounded 
on the real line. (iii) A u.a.p. function / is uniformly continuous for 
— oo < x < oo. (iv) The limit function of a uniformly convergent sequence 
of u.a.p. functions on (—00,00) is also u.a.p.. (v) For any u.a.p. function 
/, lirnr_ >00 (l/T) L f(x)dx exists and is finite. It is called the mean value 
of the function / and is denoted by M{f}. Since |/| is u.a.p. if / is, it 
implies that .A4{|/|} also exists for a u.a.p. function /. 

For any A S M and / a u.a.p. function, f(x)e~ is a u.a.p. function. 
Hence 

a(\) = M{f(x) e~ iXx } := lim i / f(x) e~ iXx dx (3.7) 

exists. 

The following result about a(A) [5, pp. 16-18] plays an important role 
in the development of the theory of Fourier series of u.a.p. functions. 

THEOREM 3.7 Let f be a u. a. p. function, and a{\) as in (3.7). Then 
a(X) = except for a countable number of A 's. 

For entire functions of exponential type bounded on real line, Rahman 
& Tariq [45] have proved the following 

THEOREM 3.8 If f is an entire function of exponential type r bounded 
on the real axis, and let A is any real number such that |A| > r. Then 



1 r T 
lim - / f(x) e iXx dx 



So, from the above two theorems we conclude that, if / is an entire function 
of exponential type r, and u.a.p. on the real line, then a(X) = except for 
countably many A's such that |A| < r. 

For a u.a.p. f, the values of A for which a(A) 7^ are called Fourier 
exponents and the corresponding a(A)'s Fourier coefficients of the function. 
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If Ai, A2, . . . , are the Fourier exponents and A n := a(A n ) for n = 1,2,3,... 
corresponding Fourier coefficients, the formal series 

00 
Y^ A n e iAnX = A ± e iAlx + A 2 e iA2X + ■■■ 

n=\ 

is called the Fourier series of /, and we write f(x) ~ Y0n=i A n e x ■ 

An important theorem about the Fourier series of a u.a.p. function is 
the following uniqueness theorem. 

THEOREM 3.9 A u. a. p. is always uniquely determined by its Fourier 
series. 

That is, if f\ and ji are two distinct u.a.p. functions then their Fourier 
series will be different. The next result [5, pp. 21-28] is called "Bohr's fun- 
damental theorem" or "Parseval equation" which is equivalent to uniqueness 
theorem [9]. 

THEOREM 3.10 Let f(x) ~ Y^=i A n e iAnX be any u.a.p. function with 
its Fourier series. Then 

00 

m{|/i 2 } = Ei^i 2 - ( 3 - 8 ) 

It is easy to see that every trigonometric polynomial is u.a.p.. Since the 
limit of a uniformly convergent sequence of u.a.p. functions is u.a.p., the 
limit of a uniformly convergent sequence of trigonometric polynomials is also 
a u.a.p. function. The converse of this result is also true and is contained 
in the follwing Approximation Theorem. 

THEOREM 3.11 Given a u.a.p. function f(x) ~ J2n=i A n e iA " x and a 
positive number e, there exist a trigonometric polynomial P(x) whose expo- 
nents are Fourier exponents of f and which satisfies \f(x) — P(x)\ < e for 
all x in K. 

H. Bohr [9] gave the proof of this important theorem. Later on, S. Bochner 
also proved it by giving an elegant extension of Fejer sum to the class of 
u.a.p. functions. We refer readers to [5], [9], [45] for the proof and other 
details. 

In general, the trigonometric polynomials determined by the partial sums 
of the Fourier series of a function may not converge uniformly to the function 
for all x in BL That is, Fourier series of a function may not represent the 
function in general. It is useful to have conditions under which the Fourier 
series does represent the function. One such condition is given in the next 
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theorem. But before we discuss it, we need to define what is meant by 
linearly independent set of numbers. 

DEFINITION 4 A set a u {y = 1, 2, . . .) of real numbers is called linearly 
independent if for any n the only rational non zero values of n, r2, . . . , r n 
satisfying the equation r\a\ + r2«2 + ■ ■ ■ + r n a n = are r± = ri = ■ ■ ■ = 
r n = 0. 

THEOREM 3.12 For a uniformly almost periodic function f with linearly 
independent Fourier exponents, the Fourier series Y0n=i A n e nX converges 
uniformly for all x. 

In fact, the series formed by the absolute values Y^=i \A n \ converges. By 
virtue of the uniqueness theorem f(x) = Ylri=i A n e nX . It is also known 
[5, pp. 51-52] that the Fourier series X^nLi A n e lXnX of a u.a.p. function is 
absolutely convergent if the Fourier coefficients A\,Ai,... are positive. 

We conclude the paper by presenting the following result due to Rahman 
and Tariq [45], which gives a condition on the Fourier coefficients so that 
the Fourier series represent the function. In fact this is the main result of 
their paper [45] which has been used to prove Theorem 3.6 that provides the 
extension of the theorem of Govil and Vetterlien [32] about self-reciprocal 
polynomials to the entire functions of exponential type. 

THEOREM 3.13 Let f be a u.a.p. function 



f( X )~Y, A " eiAnX 



where the Fourier coefficients A±,A2,.-. lie in a sector of opening 7 G [0 , vr) 
with vertex at the origin. Then Y^=i \An\ < 00 • 
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Numerical approximation to it using parabolic segments 
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Abstract. We derive numerical algorithms that can be used to approximate n. We utilize 
and extend standard recurrence relations that compute the area of inner and outer regular 
polygons. The relations that we derive arise from approximations of area of circular sectors 
by adjoining parabolic segments to triangular subregions of the regular polygons. We also 
discuss the accuracy of the new approach and employ Mathematica software in the present 
work to facilitate computation. 

Preliminary Results 

Archimedes employed perimeters of regular polygons to approximate n; [1] contains a 
summary of his approach. In the present paper we use Archimedes' results on the area of 
parabolic segments to extend several results in [2] which utilized areas of regular polygons 
to approximate n. Let p n , respectively P n , (see fig. 1, n = 3), denote regular polygons 
inscribing and circumscribing the unit circle, each having 2 n+1 sides. We note from [2], 
that both P n and p n can be subdivided into 2 n+l congruent, isosceles triangles each having 
interior angle vr/2 n , at the origin (fig. 4 shows half of triangle). The other two equal angles 
at the circumference are equal, in radians, to 

*_> L 

2 V 2" 

It is also noted that the interior angle is halved for successive values of n. It is known, [2] , 
that from the Taylor series for tan, sin, and cos, we have 



area(P n ) = 2 n+1 tan ( 



a 



2^+1 



IT 1+ h^TT 7T + 



71 \2 1 / 7T \4 2 / 7T \6 17 



(-. 



2 n+ij 3 V2«+ 1 / 15 \2 n + 1 J 315 



J 



(2) area(Pn) = 2« +1 sin (^) cos (^) = 2™ sin (£ 

V \2 n J 6 \2 n ) 120 \2 n ) 5040 
We also have from [2] that 

area(p n ) + 2 • area(P n ) 

(3) § vr 

so that we can write 

12 

(4) 7i = - ■ area{p n ) + - ■ area{P n ) + 0(16 n ) 



1Q -1.2n ^ 
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Figure 1: P 2 , regular polygon of 8 sides. 



which follows from the elimination of second term in the power series in (1,2). Similarly, by 
this process of elimination, we have from (1,2,4) that 



(5) 
(6) 

(7) 



7T 



0(16-") 
- 0{16- n 



- ■ area(p n+1 ) - - ■ area(p n ) 

4 1 

- • area(P n+1 ) - - ■ area(P n 

12 

- • area(P n ) + - • area(p n+1 ) + 0(16 "), 

12 

- • area{p n+1 ) + - ■ area(P n+1 ) + 0(16~ n ), 

1 



(9) = - • area(p n ) + - • area(P n+1 ) + 0(16 "). 

y y 

The foregoing, (5-9) represents the six possible combinations of areas involving pairs of dis- 
tinct terms from {P n ,p n , P n +i,p n +i}. We now show show how to derive (5,7) using areas of 
parabolic segments and employing Archimedes' formula. Consider AOPR, APCR,(G.g. 3.) 
By Archimedes' well-known formula and approximation we have area(cirular sector OPCR) s 
area(AOPR) + 4/3area(APCR). Since [2], 



area(P n ) = 2 



ra+l 



"n, "ra 



tan 



71 



2«+i 



area(p n ) 



2 n-1 s , s 



L, J n 

then by elementary calculations (see fig. 2 for notation) 

tv w 2 n+l (area(A0PR)+ 4/3 -area(APCR)) 

ASr 

3 2 

4 7T 



2 sin 



71 



2 n+l 



in+1 



i-Wi-f 



+ 2 S n _! 



(10) 



in+1 



- sin 

3 2 n + 1 



1 — cos 



7T 



2 n+l 



T sin 



7T 



-areo(p n +i) - -area(p r , 



which is (5). Let scaling factor a satisfy < a < 1. Consider now a parabola passing 
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Figure 2: |PC| = S n+1 , |OQ| = h n , |OC| = 1, |PR| = S n 



through points A, T, S in fig. 3 and the vertex of (convex up) parabola at T on the (sliding) 
line segments AT, TS depending on the value of a such that a — 0, 1 corresponds with T = 
C, Q respectively. Archimedes' result for the area of a parabolic segment and approximation 
yields that 



7T 



2 n+i (AOAS - 4/3 • A ATS) 



2 n+1 b n - 2 r 



-i 4 



b„a 1 - \ / 1 - 



'IV. 



2 n+1 tan 
4 



7T / 4 4 71 

— — 1 a H — a cos — — 

2 n+i V 3 3 2 n+1 



I - -a j area(P n ) + - ■ a ■ area(p n +i] 



Notes : Setting a = 1/2 gives (7). We have so far established a relationship between numerical 
algorithms with a higher order numerical accuracy that in [2] , and, the elimination of terms 
in the Taylor series for the area of the regular polygons. Next, for interest and completeness, 



BOLLMAN-GROSSMAN: NUMERICAL APPROXIMATION TO pi 



239 



M Figure 3: AATS, APCR, APMR, A PKR. 





ZBOC = 



2n+2 



Figure 4: C = |OA| , a = |AB| , b = |BC| , d = |OB| , |OP| = |OC| = 1 



we list new results from [2]: define real numbers a, b, c, d as follows 



7T \ / 71 

a = tan ( - — r^ I — tan 



tan 



c = sec 



d = sec 



2 n+l 

71 

2 n+lJ ' 
2 n+2j 



2 n+2 
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In [2] , (figure 4) we have obtained the following polynomial relations 

b 3 + ab 2 + b - a = 0, 

(?-(?- a 2 c - a 2 = 0, 

d 6 - {a 2 + l)d 4 + Aa 2 d 2 - Aa 2 = 0. 

Algorithms and Numerical Results 

For convenience, we adopt a coordinate system with the origin at the center of the unit 
circle and the midpoint of one side of p n , the inscribed 2 n+1 — gon on the positive y-axis. We 
approximate the circle by the parabola interior to the circle and passing through the three 
points (0,1) and (±a, Vl — a 2 ), corresponding to points C,R,P respec. in fig. 3, where the 
value of a depends on the number of sides of the inscribed polygon. 

These three points are common to the circle and the parabola. It is well-known (see, for 
example, [3], p. 524) that two distinct conic sections can intersect in no more than four 
points. Taking the symmetry of the coordinate system into account, there can be no other 
intersection points, and thus the parabola remains inside the unit circle for all other values 
of x between —a and a. 

We compute the area of the parabolic wedge bordered by the approximating parabola and 
the two line segments from the origin to the points (±a, \/l — a 2 ) by dividing it into two 
right triangles and a parabolic segment. The right triangles each have area a- \/\ — a 2 , which 
arises from the classical formula for the approximation of 7r by inscribed 2 n -gons. Denote by 
Sn,n > 1> the side length of a 2 n+1 -gon inscribed in the unit circle; then la = s n . 

It follows by substitution that the two right triangles have combined area A n , where 



For the parabolic segment, we make use of the result of Archimedes that the area of a 
parabolic segment is equal to | of the area of the triangle with the same base and height. 
The area a n of the parabolic segment is then 

a n = -■ ~bh, 

where b denotes the base and h the height of the triangle. We have b = s n . The height h is 
one leg of a right triangle with other leg s n /2 and hypotenuse s n+ i. It follows that 



/'=« + i-' S " 



4 



and we have 



s 2 

2 n 



3 s„ys n+1 - ] 



The area of the unit circle is thus approximated by the sum of 2 n of these parabolic wedges: 
n w T ■ (a n + A n ) = 
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The sequence {s„} of side lengths is given by 



'n+l 



X 4 - si 



with s 2 = V2 (see, for example, [2]). We note further that s n = 2sin(7r/2™ +1 ). 

Numerical evidence, computed with Mathematica, gives the following sequence of approx- 
imations to 71. Bold digits are exact. 



n 


2 n+l 


^n 


Approximation to tt 


% error 


1 


4 


1.414214 


3.1045694997 


-1.18 


2 


8 


.765367 


3.1391475703 


-.078 


3 


16 


.390181 


3.1414377167 


-.0059 


4 


32 


.196034 


3.1415829366 


-3.09 x 10" 4 


5 


64 


.098135 


3.1415920458 


-1.93 x 10~ 5 


6 


128 


.049082 


3.1415926156 


-1.21 x 10~ 6 


7 


256 


.024541 


3.1415926512 


-7.56 x 10~ 8 


8 


512 


.012272 


3.1415926534 


-4.72 x 10~ 9 


9 


1024 


.006136 


3.141592653580 


-2.95 x 10~ 10 


10 


2048 


.003068 


3.1415926535892 


-1.85 x 10- 11 


11 


4096 


.001534 


3.14159265358976 


-1.15 x 10 12 


12 


8192 


.000767 


3.141592653589791 


-7.21 x 10~ 14 


13 


16384 


.000383 


3.1415926535897931 


-4.51 x 10~ 15 



This sequence of approximations to re is, owing to the nature of the approximating parabo- 
las, an underestimate. We seek next to find a corresponding upper bound for tt through a 
sequence of parabolic sectors. 

We begin by constructing the tangent lines to the unit circle at the endpoints of the 
inscribed 2 n+1 -gon p n . The symmetry of the coordinate system means that these lines will 
intersect at a point (0, y) on the positive y-axis, and we then consider the parabola passing 
through the three points (0,y) and (±0, \/l — a 2 ), corresponding to points M,R,P respec. 
in fig. 3, where the value of a depends on the number of sides of the inscribed polygon. 

Routine calculation of the intersection point reveals that y = i-^ . We calculate the area 
of the parabolic sector as the sum of two areas involving triangles: 

4 

^sector TT top triangle T" ^interior triangle 



The top triangle has area ^s n ( J—^ — \f\ 
\s n \/\ — a 2 . Using the relation a = s n /2 gives 



A 



-sector 



= Sr 



and the interior triangle has area 



V4-ST 



v 3v/4^7 12 

and by adding up all 2 n+1 such sectors, the approximation 



Ti 



■tn+1 



V4~ 



3^4" 



12 
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Numerical computation yields the following overestimates to 7r: 



n 


2 n+l 


s n 


Approximation to 7r 


% error 


1 


4 


1.41421356 


4.66666667 


48.544614 


2 


8 


0.76536686 


3.47546896 


10.627613 


3 


16 


0.39018064 


3.22297468 


2.590470 


4 


32 


0.19603428 


3.16181816 


.643798 


5 


64 


.09813535 


3.14664168 


.160716 


6 


128 


.04908246 


3.14285445 


.040164 


7 


256 


.02454308 


3.14190808 


.010040 


8 


512 


.01227177 


3.14167151 


.002510 


9 


1024 


.00613591 


3.14161237 


.000627 


10 


2048 


.00306796 


3.14159758 


.000157 


11 


4096 


.00153398 


3.14159389 


.000039 


12 


8192 


.00076699 


3.14159296 


.000010 



We see that this estimate is not as accurate as our previous underestimate. In an effort to 
improve the accuracy, we change the vertex of the parabola, to the midpoint of the segment 
connecting (0,1) to the intersection point (0, -/j==f)- This gives 



0, 



l + v / T^~ 

2\/r^ 



as the third point on the parabola, which corresponds to point K in fig. 3. Recalculation 
using 

4 

^sector 7T top triangle ~r -^interior triangle 

gives a new formula for the area of a sector as 



4s, 



A 



sector 



(l + v^ 



< )+< 



YLyfi 



and a new estimate for n: 



Tl^T 



x As n (l + v/4^l) + si 



12 J A- si 



Another set of Mathematica calculations yields the following, more accurate, set of over- 
estimates to 7r: 
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n 


2 n+l 


S n 


Approximation to tt 


% error 


1 


4 


1.41421356 


3.88561808 


23.6841 


2 


8 


0.76536686 


3.30370826 


5.27489 


3 


16 


0.39018064 


3.18220620 


1.29277 


4 


32 


0.19603428 


3.15170055 


.321744 


5 


64 


.09813535 


3.14411686 


.080348 


6 


128 


.04908246 


3.14222353 


.0200816 


7 


256 


.02454308 


3.14175036 


.0050201 


8 


512 


.01227177 


3.14163208 


.00125499 


9 


1024 


.00613591 


3.141602510 


3.13747 x 10~ 4 


10 


2048 


.00306796 


3.14159512 


7.84364 x 10~ 5 


11 


4096 


.00153398 


3.14159327 


1.96104 x 10~ 5 


12 


8192 


.00076699 


3.141592807 


4.90102 x 10~ 6 



Our next attempt at an accurate overestimate considers the concave-up parabola passing 
through the three points (±b n , 1) and (0, h), which are points S, A, T respec. in fig. 3, where 
b n represents the side length of P n , a 2 ra+1 -gon circumscribed about the unit circle, and h is 
to be determined. 

This parabolic wedge can also be subdivided, into a large isosceles triangle from which a 
parabolic segment bounded above by the circumscribed polygon and below by the parabola 
has been removed. Denoting the area of the wedge by A n , we then have 

An = \b n - \b n ■ (1 - h) = -J(4/i - 1). 
2 6 6 

Once again, we have a recursive formula (from [2]) for {b n }: 62 = 2 and, for n > 2, 

b„ 



>n+l 



b \ 2 



i + Vi + (¥) 



Using this expression for b n yields 



71 



)n+l "n 

6 



(4/i-l). 



It remains to determine the value of h. Taking h =the length of the segment connecting the 
origin to the midpoint of one side of the inscribed 2 ra+1 -gon gives 



h 



and thus 



7T 



->n+l 



6 




1 



Numerical work with Mathematica reveals that this formula is an underestimate of it, and 
it is less accurate than our first approximation. 
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n 


2 n+l 


b n 


Approximation to 7r 


% error 


1 


4 


2.000000 


2.437903 


-22.40 


2 


8 


.828427 


2.977387 


-5.23 


3 


16 


.397825 


3.101061 


-1.29 


4 


32 


.196983 


3.131490 


-.322 


5 


64 


.098254 


3.139069 


-.080 


6 


128 


.049097 


3.140962 


-.002 


7 


256 


.024545 


3.141435 


-5.02 x 10~ 3 


8 


512 


.012272 


3.141553 


-1.25 x 10~ 3 


9 


1024 


.006136 


3.141583 


-3.14 x 10" 4 


10 


2048 


.003068 


3.141590 


-7.84 x 10~ 5 


11 


4096 


.001534 


3.1415920 


-1.96 x 10~ 5 


12 


8192 


.000767 


3.1415925 


-4.90 x 10~ 6 


13 


16384 


.000383 


3.14159261 


-1.22 x 10~ 6 



Taking h — 1 gives an overestimate, since then the parabola becomes the side of the 
circumscribing polygon and the 2 n+1 wedges exactly fill that polygon. 

If we split the difference and take h to be the y-coordinate of the midpoint (a = 1/2, sect. 
1, point T in fig. 3) of the segment joining the midpoint of the side of an inscribed 2 n -gon 
to the point (0,1), we have 




l-i/l 



or 



2 2V V2 



With this value, our approximation to it becomes 



ir^T 



■ 1 .^|l + 2jl 



which is revealed by Mathematica to be an accurate overestimate of it, with accuracy com- 
parable to that of our underestimate above. 
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n 


2 n+l 


Approximation to 7r 


% error 


1 


4 


3.218951 


2.46 


2 


8 


3.145548 


.126 


3 


16 


3.141829 


.008 


4 


32 


3.141607 


4.66 x 10" 4 


5 


64 


3.141594 


2.91 x 10~ 5 


6 


128 


3.1415927 


1.81 x 10~ 6 


7 


256 


3.141592657 


1.13 x 10~ 7 


8 


512 


3.1415926538 


7.09 x 10- 9 


9 


1024 


3.1415926536 


4.43 x HT 10 


10 


2048 


3.14159265359 


2.77 x 10~ n 


11 


4096 


3.1415926535898 


1.73 x 10~ 12 


12 


8192 


3.141592653589796 


1.08 x 10~ 13 


13 


16384 


3.1415926535897934 


6.76 x 10~ 15 



We have examined the sensitivity of the accuracy of this approximation to the value of 
the constant \. Re-calculation with values running through 2 .t" 1 for k — 2, 3, . . . , 16 show 



2 k 



less accuracy than \. At the same time, re-calculation with 



2 k 



for k — 2, 3, . . . , 16 yields 



a sequence of overestimates to n that are less accurate than that achieved with \. 
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ON THE UNIFORM SPECTRUM OF BOUNDED FUNCTIONS AND 
APPLICATIONS TO DIFFERENTIAL EQUATIONS 

NGUYEN VAN MINH, GISELE M. MOPHOU, AND GASTON N'GUEREKATA 

Abstract. This paper is a survey of the (new) concept of uniform spectrum of bounded 
functions. We review its properties and relationship with classical concepts of spectra of 
bounded functions, and some applications to differential equations in Banach spaces. 



1. Introduction and Notations 

This paper is a brief survey of the concept of uniform spectrum of bounded functions 
introduced in 2004 in the literature by Diagana, Minh and N'Guerekata [4] as an essential 
tool in the study of the existence and uniqueness of an almost automorphic mild solution 
to the linear differential equation 

(1) u'(t) =Au(t) + f(t), teR, 

where A is a (unbounded) linear operator which generates a holomorphic semigroup of 
linear operators on a Banach space X, and the input / is almost automorphic (in Bochner's 
sense). The difficulty in this case is that the classical semigroup theory methods developed 
by Minh, Naito, Batty and others, do not apply since the existence of a mild solution of 
this equation is not guaranteed; the main reason is that the group of translations is not 
necessarily strongly continuous in the case of almost automorphic function. The concept 
of uniform spectrum of bounded functions was introduced to overcome this obstacle. And 
it turns out to coincide with the well known Carleman spectrum as we will see below. 
This yields a much broaden applications field of the concept of the uniform spectrum . 
Further studies in this direction can be found in [6, 7, 8, 14]. 

Before proceeding, let us fix our notations. Let A be a linear operator on a complex 
Banach space X. In what follows, D(T), <j(A) and p(A) will denote the domain, spectrum 
and the resolvent set of A, respectively. In particular, <Ji(A) stands for cr(A) fl iR. The 
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field of complex numbers is denoted by C. 3ftz and Qz denote the real and imaginary 
parts of a given complex number z, respectively. We denote by T the unit circle in the 
complex plane. The notation BC(R,X) stands for the space of all X-valued bounded 
and continuous functions on R. We will denote by BUC(M, X) the subspace of BC(M, X) 
consisting of all uniformly continuous and bounded functions. We set BC l (WL, X) := {/ G 
BC{R,X)\3f G 5C(R,X)} 

Let M. be a closed subspace of BC(W, X). The operator Am of multiplication by A is 
defined on D(A M ) := {g E M : g(t) G D(A) \/t G R,Ag(-) G M}, and A M 9 ■= Ag(-) for 
all g G D(A M )- 

1.1. Almost Automorphic Functions. 

Definition 1.1. (Bochner) A function / G C(R, X) is said to be almost automorphic if 
for any sequence of real numbers (s' n ), there exists a subsequence (s n ) such that 



(2) lim lim f(t + s n - s m ) = f(t) 
for any i£l. 

The limit in (2) means 

(3) g(t)= lim f(t + s s ) 

n— >oo 

is well-defined for each t G R and 

(4) f(t) = lim g{t - s n ) 

n— >oo 

for each t G R. 

Remark 1.2. It is clear from the definition above that constant functions and continuous 
periodic and almost periodic functions are almost automorphic. 

Remark 1.3. Unlike almost periodic functions, an almost automorphic function may not 
be uniformly continuous (see examples below) 

Remark 1.4. Because of pointwise convergence the function g is measurable but not neces- 
sarily continuous. It has been proved that if g is continuous, then / is uniformly continuous 
(cf [13]). 

Example 1.5. The following are classical examples of almost automorphic functions. 
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(i) (Veech) The function / : R — >■ T (the unit circle in C defined by 

9 _i_ pit I f,i\/2t 

f{t) : = 2 + e +e _ 

is almost automorphic but not almost periodic, 
(ii) (Levitan) The function / : R — >■ R defined by 

/(t) := sin- 



2 + cost + cosylt 
is almost automorphic but not almost periodic since it is not uniformly continuous 

If the limit in (3) is uniform on any compact subset K C R, we say that / is compact 
almost automorphic. 

Theorem 1.6. Assume that f , f\, and ft are almost automorphic and A is any scalar, 
then the following hold true. 

(i) Xf and f\ + fi are almost automorphic, 

(ii) f T (t) := f(t + t), iGl is almost automorphic, 
(iii) f(t) := f(—t), t E R is almost automorphic, 
(iv) The range Rf of f is precompact, so f is bounded. 

Proof. See [12, Theorems 2.1.3 and 2.1.4], for proofs. □ 

Theorem 1.7. If {f n } is a sequence of almost automorphic H-valued functions such that 
/„4/ uniformly on R ; then f is almost automorphic. 

Proof, see [12, Theorem 2.1.10], for proof. □ 

Remark 1.8. If we equip AA(K), the space of almost automorphic functions with the sup 
norm 

ll/l|oo=SU P ||/(t)|| 

tern. 
then it turns out to be a Banach space. If we denote KAA(K), the space of compact 

almost automorphic X- valued functions, then we have 

AP(X) C KAA(X) c AA(X) C BC(R,X) 

Theorem 1.9. If f G AA(X) and its derivative f exists and is uniformly continuous on 
R, then f e AA(X). 

Proof. See [12, Theorem 2.4.1] for a detailed proof. □ 
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Theorem 1.10. Let us define F : R H> X by F(t) = /„* f(s)ds where f G AA(X). Then 
F G AA(X) iff R F = {F(t)\ teR} is precompact. 

The monograph [12] presents detailed information about almost automorphic functions 
along with some applications to differential equations. 

2. Spectrum of bounded functions 

In this Section, we recall briefly the classical theory of spectrum of functions from 
Harmonic Analysis standpoint. 

2.1. Bohr spectrum. A natural extension of Fourier exponents of periodic functions to 
almost periodic functions is the notion of Bohr spectrum. 

Consider an almost periodic function / : R — > X . It is well-known that for every AgI, 
the average 

a(/,A):= lim -L f e - M f(t)dt 

exists and is not zero at most at countably many points A. 

The set cr b (f) :={Ag1: a(f, A) 7^ 0} is called the Bohr spectrum of /. 

2.2. Carleman spectrum. Let u G L°°(M, X). The Carleman transform u of u is defined 
by 

J °° e- xt u(t)dt (ReX > 0) 
u{\) := < 

- f °° e xt u{-t)dt (ReX<0), 
Thus u is a holomorphic function defined on C\zR. 

Definition 2.1. A point z£ G iR is called regular for u if u has a holomorphic extension 
to a neighborhood of i£, i.e. there exists an open neighborhood U of it; and a holomorphic 
function h: U -)■ X such that h(X) = u(X) for all A G U\iR. 
The Carleman spectrum sp c (u) of u is defined by 

sp c (u) := {£ G R : z£ is not regular}. 



For each u G i?C(IR,X) we let M. u := span{S(r)u,r G IR} (here S(t) denotes the 
translation BC(R, X) 3 /(•) H> /(r+-) G SC(R, X)). It is a closed subspace of BC(R, X). 
Note that if 
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• u G BUC(M.,X), the Carleman spectrum of u coincides with its Arveson spec- 
trum, defined by (see [2, Lemma 4.6.8]) 

(5) i sp c (u) = a(V u ). 

where T> u is the infinitesimal generator of the restriction of the group of transla- 
tions (S(t)\Mu)tm to the closed subspace A4 U . 



• If / is almost periodic, then sp c (f) = <J b (f) 
Proof. See Proposition 1.2 [8] □ 

Below we list some properties of the Carleman spectra of functions. We refer the reader 
to [2] for more details and information on other properties of the Carleman spectrum. 

Proposition 2.2. . Let u,u n ,v G BC(R, X) such that linin^oo \\u n — u\\ = 0. Then 
(i) sp c (u) is closed, 
(ii) sp c (u + v) C sp c (u) U sp c (v), 
(iii) If sp c (u) = 0, then u = 0, 
(iv) If sp c (u n ) C A, \/n, then sp c (u) C A. 

Proposition 2.3. Let u e L°°(R,X). 

(i) If sp c (u) = {0}, then u is constant, 
(ii) If sp c (u) is finite, then u is a trigonometric polynomial. 

Theorem 2.4. Let u G BUC(W, X). If sp c (u) is discrete, then u is almost periodic. 

2.3. Beurling spectrum. 

Definition 2.5. Let -u G L°°(R, X). The Beurling spectrum spb{u) of -u is te set 

sp B (u) := {£ G R : Ve > 3v G L X (IR) si. supp{v C (£ - e, £ + e) and w * v ^ 0} 

The following relation between the Carleman spectrum and the Beurling spectrum 
holds. 

Theorem 2.6. Let u G L°°(R.X). Then 

sp c (u) = sp B (u). 

Proof. See [2, Propostion 4.8.4]. □ 

Now we present a (new) notion of spectrum of bounded functions from Differential 
Equations standpoint. 
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3. Uniform spectrum of a function in BC(R,X) 

Consider the evolution equation 

(6) d ^ = Au(t) + f(t), 

where A generates a Co-semigroup (T(t)) t >o on a Banach space X, / is an almost periodic 
function. 

Theorem 3.1. If (T(t))t>o is analytic, f is almost periodic and 



a(A)ma F (f) 



then there is a unique almost periodic mild solution u such that Ob(u) C (7f (/); here Cp{f) 
denotes the set of frequencies of f. 

This important result is now a classical one. Our concern is to extend it to the case 
where / is almost automorphic. But first, let's make the following remarks. 

Remark 3.2. In all methods used to study this problem, the uniform continuity of / plays 
an important role. For instance that yields: 

• The strong continuity of the translation group; 



. a(V f ) = ia F {f) 
where T>f is the differential operator on the closed subspace of BC(R, X) gener- 
ated by /(* + •), t e R. 

Since an almost automorphic function may not belong to BUC(R,X), the con- 
dition 

a(A) n isp c (f) = 

is no longer sufficient for Eq(2) to have a unique mild solution. 

To fix this problem, Diagana, Minh and N'Guerekata introduced the concept of uniform 

spectrum as an extension of the Carleman spectrum in their pioneering paper [4] as follows. 

Consider the following simple ordinary differential equation in a complex Banach space 

X 

(7) x'{t) - Xx = /(*), 

where / G BC(K). If 9ftA ^ 0, the homogeneous equation associated with this has an 
exponential dichotomy, so, (7) has a unique bounded solution which we denote by Xf^(-). 
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Moreover, from the theory of ordinary differential equations, it follows that for every fixed 

£gR, 



(8) 



x fAO 



(9) 



/i e H *~ t] f(t)dt (if i?eA<0) 



- //° e A «-*)/(t)d« (ifi?eA>0). 
f^e-^m + vWv (if i?eA<0) 



Jo 



-Xr, 



f(Z + V )dv (ifi?eA>0). 



As is well known, the differentiation operator D is a closed operator on BC(M., X). The 
above argument shows that p(T>) D C\iR and x/,a = (^ — A) -1 / for every A G C\iR and 
f EBC(R,X). 

Hence, for every A G C with 9£A ^ and / G BC(R, X) the function [(A - V)- l f](t) = 
S{t)f(X) G BC(R,X). Moreover, (A - V)- 1 f is analytic on C\iR. 

Definition 3.3. Let / be in BC(R,X). Then, 

(i) aGi is said to be uniformly regular with respect to / if there exists a neighbor- 
hood U of ia in C such that the function (A — V)~ l f, as a complex function of A 
with 9ftA 7^ 0, has an analytic continuation into U. 
(ii) The set of £ G R such that £ is not uniformly regular with respect to / G _BC(R, X) 
is called uniform spectrum of / and is denoted by sp u (f). 

If / G BUC(R, X), then a G R is uniformly regular if and only if it is regular with 
respect to /. In fact, this follows from (5) via the identity 



R(X,V f )f 



e-*S(0fdt, KA^O. 



(Here the integral is taken in the space BUC(W, X)). For / G BC(R, X), in general, we 
do not know if the above (5) holds. We now list some properties of uniform spectra of 
functions in BC(R,X). 

Proposition 3.4. Let g, f, f n G BC(R, X) such that f n — > f as n — > oo and let A C R be 
a closed subset satisfying sp u (f n ) C A for all n G N. Then the following assertions hold: 

(i) sp u (f) = sp u (f(h + -)); 

(ii) sp u (af(-)) C sp u (/), ot G C; 
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(iii) spctf) C sp u (f); 

(iv) sp u (Bf(-)) C sp u (f), B G L(X); 

(v) sp u (f + g) C sp u (f) U sp u (g); 

(vi) sp u (f) C A; 

(vii) If / 6 BC\R,X), then S p tt (/') C sp u (f). 

Proof. For the proof see [3] and [4] . □ 

Corollary 3.5. For any closed subset AcR, the set A n (X) := {/ G BC(R,X) : sp u (f) C 
A} is a closed subspace of 5C(R,X) which is invariant under translations. 

Proof. Obvious. It suffices to use the properties of uniform spectrum above. □ 

We also have the following 

Lemma 3.6. Let A be a closed subset of R and let T>^ u be the differentiation operator 
acting on A U (X). Then we have 

(10) a(V Au ) = iA. 

Proof. Since the function g a defined by g a (t) :— e iat x, « G M, i G M, x ^ 0, is in A U (X) 
and sp u (g a ) = sp(g a ) = {a} we see that ia G a(T> Au ), that is, iA C a(T>\ u ). 
Now let us prove the converse. For (3 G R\A we consider the equation 

(li) 0g-gf = f, feA u (X). 

We will prove that (11) is uniquely solvable for every / G A U (X). This equation has at 
most one solution. In fact, if gi,g2 are two solutions, then g — g\ — g-i is a solution of 
the homogeneous equation, that is for / = 0. Taking the Carleman transform of both 
sides of the corresponding equation, we see that sp c (g) C {/?}. Since g G A U (X) we have 
s Pc{g) C A. Combining these facts we have sp c (g) = 0, that is g — 0. 

Now we prove the existence of at least one solution to Eq. (11). In fact, for 9ftA 7^ 
Eq. (11) has a unique solution which is nothing but (A — T>)~ 1 f. Since i(3 G" sp u (f), by 
definition, the function (A — T>)~ 1 f, defined on C\iR, has an analytic continuation into 
a neighborhood of i/3. In particular, the following limit exists lim^j^A — T>)~ 1 f := go. 
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We are going to show that g is a solution of (11) and go G A M . Indeed, since 

(tf-PXA-P)- 1 / = ((i(3-X) + (X-V))(X-V)- 1 f 

= (^-A)(A-P)-V + (A-^)(A-P)-V 
= (;/? - A)(A - P)" 1 / + /, 
we have 

(12) \imUp-V)(\-V)- 1 f = f. 

A— np 

Using the closedness of the operator {i(3 — T>), we come up with go being in the domain 
of i/3 — T> and {i(3 — T>)go = f. Next, to show that go G A U (X), in view of Corollary 3.5 it 
suffices to show that for every 3ftAo 7^ 0, the function (Ao — ^P) -1 / is in A U (X). Since both 
A and Ao are in p(T>), and 

(A - V)-\X - V)- 1 / = (Ao - V)-\X - V)- 1 / 

we see that (A — P) _1 (Ao — P) -1 / has an analytic continuation into a neighborhood of 
i(3. This completes the proof of the lemma. □ 

It turns out that the uniform spectrum of a function / G BC(W, X) coincides with its 
Carleman spectrum, as shown via this Proposition due to Liu, N'Guerekata, Minh and 
Vu ([7]): 

Proposition 3.7. Let / G BC(R,X). Then 

(13) s Pu (f) = sp c (f). 
Proof. First we show that 

(14) sp u (x fM ) = sp u (f), 

where Xf t \ is defined by (8), and Ao is a given complex number such that 9ftAo 7^ 0. In 
fact, we have, for every 9ft A 7^ 

(A - V)- l x fM = -(A - V)-\X - V)- l f = -(A - V)-\X - V)- l f. 

So, (A — T>)~ 1 Xf j \ has an analytic continuation into a neighborhood of i(3, where (3 is a 
real number, if and only if so does (A — D) -1 /. That is (14) holds. Note that since the 
derivative of Xf^ is bounded, this function is uniformly continuous, so, 



spc(f) C sp u (f) = sp u (x fM ) = sp c (x f . 



Ao, 
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To complete the proof of this proposition, it suffices to show that 

(15) (»Wc(/)) C (R\sp c (x fM )). 

To this end, we will use the Beurling spectrum as an alternative of the Carleman spectrum 
(Theorem 3.6). That is, £ G (M\sp c (f)) , if and only if there is a positive e such that if 
<p G L X (IR) with the support of its Fourier transform supp(<p) is contained in (£ — e, £ + e), 
then (j) * f — 0. Next, it can be easily checked that 

* XfM = <t> * (A - vy x f = (A - V)~ V * / = 0. 

This shows that if f G (R\sp c (/)), then £ G (R\sp c (x fM )). That is (15) holds. This 
completes the proof of the proposition. □ 

3.1. Applications to differential equations. This Section is a slight variation of [4]. 
We denote by T := {KAA(H), AA(H)}. Now let us consider (2) where A is a (unbounded) 
linear operator which generates a holomorphic semigroup of linear operators on a Banach 
space X and / G T . 

As described earlier, the main difficulty that arises here is concerned with the non- 
uniform continuity of almost automorphic functions. This implies the non-strong conti- 
nuity of translation semigroup in the functions space T. Therefore, many elegant proofs 
using semigroup theory fail. To overcome this difficulty we will use the concept of uniform 
spectrum. 

For any closed subset Aclwe denote 

T k := {u G T : sp u {u) C A}. 

By the basic properties of uniform spectra of functions, Tk is a closed subspace of 
BC(W, X). Denote now by T>\, the part of the differential operator d/dt in JF A . We 
have the following: 

Lemma 3.8. 

(16) a{V A ) = iA. 

Proof. The proof follows the one of Lemma 2.5 [4]. □ 

Now consider the operator A\ of multiplication by A and the differential operator d/dt 
on the function space J-\. 
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By definition the operator A\ of multiplication by A is defined on D(Aa) '■= {g G Ts. '■ 
g(t) G D(A) Vt G R,Ag{-) G F A }, and Ag := Ag(-) for all g G D(A A ). We have the 
following important result 

Theorem 3.9. Assume that Act «s closed. Then the operator A\ of multiplication by 
A in JFa is the infinitesimal generator of an analytic Co-semigroup on T A - 

Proof. We will prove that A\ is a sectorial operator on JF A . In fact, first we check that 
A\ is densely defined. Consider the semigroup 7a (t) of operators of multiplication by 
T(t) on JF A . We now show that it is strongly continuous. Indeed, suppose that g G !F\, 



since R(g) is relatively compact we see that the map [0, 1] x R(g) 3 (t,x) i— ¥ T(t)x G X 
is uniformly continuous. Hence, 

sup\\T(t)g(s)-g(s)\\^0 

seR 

as t — >■ 0, i.e., the 7a {t) is strongly continuous. By definition, g G D(A\) if and only if 
g(s) G .0(A), Vs G R and A#(-) G J^a- Thus, 

r Wg W- 9 (.) = i f mAgisH , vt >o, s6R . 

Therefore, 

lim^ll ^-'W - I f'T(0A 9 ( s)m =0, 

i.e., g is in D(G), where G is the generator of T\(t) and A\g = Gg. Conversely, we can 
easily show that G C A&. 

Now it suffices to prove that ct(Aa) C o~(A) to claim that Aa is a sectorial operator. In 
fact, let /i G p(^4). To prove that fi G p{Aa) we show that for each h G JF A the equation 
fig — A\g = h has a unique solution in Ts.- 

But this follows from the fact that (fi — .Aa) _1 M') G ^a and that the equation 

fix — Ax = y 

has a unique solution x in X for any y G X. □ 

Now observe as in [4] that if A be the generator of an analytic semigroup, then the 
operator A\ of multiplication by A and the differential operator T>\ on T\ are commuting, 
i.e. 

(17) R(1,V a )Ta(t) = Ta{t)R(1,V a ), Vt > 0. 
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So, by the spectral properties of sums of commuting operators, we have the following 
result. 

Theorem 3.10. If a (A) fl i\ = 0, then for every f G Ta there exists a unique u G J- a 
such that 



A A + V A u = f. 
Proof. Since ,4a and T>a commute and satisfy Condition P, the sum A\ + T>\ is closable 



(denote its closure by .Aa + V\). 

Using the fact that cr(A) fl zA = and properties of commuting operators , we see 



thatn G p(A A + V/ 



Therefore for every / G Ta there exists a unique u G D(Aa + V>a) such that 



A A + V A u = f. 

□ 
Now we will relate it with the notion of mild solutions to evolution equations. 



Lemma 3.11. Let u } f G T . If u G D(A\ + V\) and Aa + I^a u — f, then u is a mild 
solution of Eq. (4-11)- 

Proof. In fact the lemma follows immediately from the following: 

For every u G T we say that it belongs to D(L) of an operator L acting on T if there 
is a function / G T such that 

(18) u(t) = T(t-s)u(s) + J T(t-0f(0<%, Vt>s,t,8ER. 

By a similar argument as in the proof of [10, Lemma 3.1] we can prove that L is a closed 
single-valued linear operator acting on T which is an extension of Aa + £>A- 



Thus, L is an extension of ,4a + ^V- This yields that u is a mild solution of Eq. 

(4.11). □ 

As an immediate consequence of the above argument we have: 

Theorem 3.12. Let A be the generator of an analytic semigroup and let A be a closed 
subset o/R. 

Then it is necessary and sufficient for each f G Ta there exists a unique mild solution 
u G Ta to Eq. (4-11) that the condition o-(A) fl iA = holds. 
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Proof. The sufficiency follows from the above argument. The necessity can be shown as 
follows: 

For every (gA, obviously that the function h : R 3 t i— > ae l & is in JF A , where a G X is 
any given element. 

By assumption, there is a unique g G D(A\) such that i£g(t) — Ag(t) = h(t) for all 

teR. 

One can easily show that g(t) is of the form be l &. Hence, b is the unique solution of the 
equation i£b — Ab = a. 

That is i£ £ a(A A ), so zA n cr(v4 A ) = 0. D 

Theorem 3.13. Let A be the generator of an analytic semigroup such that o~(A) l~l 
i s Pu (f) = 0. 

Then Eq. (4-11) has a unique mild solution w in T such that sp u (w) C sp u (f). 

Proof. Set A = sp u (f). Then by the above argument we get the theorem. □ 

Remark 3.14. Note that Theorem 4.13 is an extension of Theorem 4.1. Indeed if / is 
almost periodic (hence uniformly continuous), then 



°fU) = S Pc(f) = SPu(f)- 

A version of this theorem can be stated for bounded solutions that are not necessarily 
uniformly continuous. See [7]. 
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Abstract 

Sequential procedures are concerned with statistical analysis of 
data when the number of observations is not predetermined. 
The purposes of the present paper are [4] : 

1. the description of the SPRT (SPRT= Sequential Probability Ra- 
tio Test) in the case of exponential patterns; 

2. the evaluation of the mean number of samples for the proposed 
test in our case; 

3. the obtaining of signal to noise ratio which defines a measure of 
two classes separability for the patterns with exponential distri- 
bution) . 

AMS Subject Classification: 62-xx, 62Lxx, 62L12. 

Keywords: Sequential Probability Ratio Test, sequential testing, 
statistics hypothesis, sequential procedures, sequential classification, 
exponential patterns, samples, signal to noise ratio. 

1 Introduction 

1.1 SPRT for Exponential Type Densities 

Let observable random variables Y±, Y2, . . . which are independent and 
identically distributed, belonging to u\ or u>2 class. 
We want to test the hypotheses 

Hq : X G uj\ against Hi : X G 102- 
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The probability density functions for the two classes are of exponential 
type: 

f(X; do) = e c(x)N(8 ) + D(x )+ M(e )^ for c{x) < 
in the case of first class and 

f(X; 9 1 ) = e C(X)N(8 l)+ D(X) + M( ei )^ for c{x) < 

in the case of second class, where the functions N(9i), M{9i) satisfy the 
condition 



J f(X; 9i)dX = l, * = 0,1. 



Suppose we observe at first X\. 

For the first stage we shall have the likelihood ratio: 

f{Xr, 6>i) e c(x 1 )N(e 1 )+D(x 1 )+M(e 1 ) 

Zl = ^ f(Xr, O ) = ^ e C(X 1 )N(8 )+D(X 1 )+M(e ) ; 

therefore 

zi = C(X 1 )[N(e 1 ) - N(9o)} + M(9 1 ) - M(9 ). 

For the stopping bounds 0<S<^4<cx)we define the sequential 
probability ratio test at the first stage as: 

i) accept Ho if z\ < B, namely 

C(X 1 )[N(9 1 ) - N(9o)\ + M(0i) - M(0 O ) < B 
or 

C(Xi) < [JV(0i) - N(9 )]- 1 [B - M(9 1 ) + M(9 )}; 

ii) reject Ho if z\ > A, namely 

C(X 1 )[N(9 1 ) - N(9 )} + M{9 1 ) - M(9 ) > A 



or 



C(Xi) > [N(6i) - iV^o)]" 1 ^ - M{9 1 ) + M(9o)}; 
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iii) continue sampling with X2 if B < z\ < A, therefore 

B < C(Xi)[W(0i) - N(d )] + M(0i) - M(0 O ) < A 

or 

U < C{X 1 ) < V, 

where 

U = [N(6 1 ) - iV^o)] -1 ^ - M(0i) + M(0 O )], 

V = [iV(^) - iV(0o)] _1 [ s - M(0i) + M(9 )}. 
The likelihood ratio at the n stage is 

n n 

Z n = Y,z t = n[M(0i) - M(0 O )] + [JV(^i) - iV(0 o )] J^ C(^) (1) 

and the sequential probability ratio test becomes: 
i) accept Ho if 

n 

Y.CiXi) < [JV(0i) - AWr 1 ^ - n(M(0i) - M(0 O ))]; 
i=i 

ii) reject Ho if 

]T CpQ) > [JV(6»i) - iV(0 o )] _1 L4 - n (Af(0i) - M(0 O ))]; 

i=l 
iii) continue sampling with X n+ i if 

n 

VF < X^ C(X 4 ) < T, 

4 = 1 

where 

W = [iV(^) - NiOo)}- 1 ^ - ra(M(0i) - M(9 ))], 

T = [JV(0!) - Ar(^)]- 1 ^ - n (M(0i) - M(0 O ))]. 
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2 Description of Sequential Probability Ratio Test 

2.1 SPRT in case of the pattern having an exponential dis- 
tribution 



A -e- A ° x , for X > 0, 
0, otherwise 



Consider the following probability density functions for the two classes: 

f(X; A ) = 
in the case of first class and 

f(X; Ai) = 



Ai-e" AlX , for X > 0, 
0, otherwise 

in the case of second class, with Ao, Ai > and Ai > Ao- 
Suppose we observe at first X\. 
We shall calculate 



z\ = In „,„ ;— r = In 



Ai-e" AlXl . A i 



/(Xi; A ) A -e- A ° Xl A 

In our case we define the sequential probability ratio test at the first 
stage as: 

i) accept Ho if z\ < B, namely 

ln^-X l (X 1 -X )<B 
Ao 



or 



(Ao-Ax)" 1 



ii) reject Hq daca z\ > A, namely 



Ai 



B + In 



<X, 



ln^-Xi(Ai-Ao)>A 

Ao 



or 



Xi < (A - Ax)" 1 



A + ln 
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iii) continue sampling with X2 if B < z\ < A, therefore 



B<ln^-X 1 (X 1 -X ) <A 

Ao 

or 



(Ao - Ai) 



-1 



B + In 



<X±< (A - A] 



1-1 



A + ln 



The likelihood ratio at the n stage is 



Z n = y^Zi = nln- (Ai - A ) V] x % 

i=i i=i 

and the sequential probability ratio test becomes: 
i) accept Hq if 



(3) 



(Ao - Ai) 



-1 



B + nln 



Ai 



< 



E* 



i=i 



ii) reject Hq if 



^^^(Ao-Ai)- 1 



i=i 



^4 + nln 



Ao 



iii) continue sampling with X n+ \ if 

Ao~ 



(Ao-Ai)- 1 



B + nln 



Ai 



<£*i< (Ao-Ai)" 1 



i=l 



A + nln 



Ai 



Theorem 1 T/ie selection mean volume of sequential probability ratio test 
is given by the formula 



B(l-a) + Aa B(3 + A{l-(3) 

E[n\=po — ; j- ^+Pi 



*%- xJ" 1 



l *t + fe-1 



(4) 



Po a^ Pi are t/ie probabilities that the hypothesis Ho and respectively 
Hi 6e trues; 
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• A and B are the stopping bounds for the sequential probability ratio 
test ; 

• a and (3 are the error probabilities of first and second kind. 

Proof: 

We need the following three stages in order to determine the selection mean 
volume: 

1. Calculate the conditional mean of z\ given Aj, i = 0, 1: £Ajzi]. 

2. Calculate the conditional mean of n given Aj, i = 0, 1: E^A n y 

3. Calculate i£[n]- the selection mean volume of the sequential proba- 
bility ratio test. 



Stage 1. We shall have 



E\ a [z\] = E\ 
therefore 



A, 



A, 



o 



Ai 



In -1 - Xi(Ai - A ) = In -i - (Ai - A )£a [*i]; 



o 



Ai Ai — Ao i-^i /« 
m 

Similarly, we obtain 



*.M=^-^=fc£- £-i). <*) 



^w-^-^r 2 --^^- 1 !- (6) 



Stage 2. Because of fact that 



we have 



and 



E Xi [n}= , * = 0,1 (7) 



„ r . B(l — a) + ^4a 

^AoN = — ^ 7^ v (8) 



!?/? + .4(1-/3) 

^AiN = T" ^- (9) 

ln &+ (£-1 
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Stage 3. We know that we can determine the selection mean volume 
of sequential probability ratio test using the formula 



E[n] = p E Xo [n\ + piE Xl [n\; 
therefore one deduces that 



B(l-a) + Aa B0 + A(l-0) 

E[n\=po — ; —^ ^+Pi 



In 



Ao 



S- 1 



ln i^ + ft-1 



(10) 



(11) 



Theorem 2 The signal to noise ratio which defines a measure of classes 
separability in the case of pattern with exponential distribution, for n sam- 
plings has the expression 



d 2 



A A 1 



In 



Ai _ Aj — Ap 
Ao ~^T~ 



Proof: 

From the relation (3) we can note that 

1. the conditional mean of Z n given Aq is 



E Xo [Z n ] = nE Xo [zi] = n 
2. the conditional mean of Z n given Ai is 



Ao \Xq 



E Xl [Z n ] = nE Xl [zi] = n 
3. the conditional variance of Z n given Aq is 



-MS- 1 



Yar Xo [Z n ] = n ■ VarAj^i] = n 
We know [7] that 



In 



Ai Ai — Aq 



Ao X 2 



d: 



2 _ { E Xi[ Z n} ~ E Xo [Z n ]Y 

Var A() [Z n ] 



(12) 



(13) 



(14) 



(15) 



(16) 
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Substituting (13), (14) and (15) into (16) we deduce: 

n 2 T 






n 



In 



Ai _ Ai— Ap 

Ao ~W 



where 
T 



ln ^ + U _1 



-MS- 1 



In 



Ai 



(17) 



ln^i-^-1 



-2 In 



Ao \Ao 

2 



Ai 



Ai 

Ao 

Ao 



+ 



Ao 



vi 



"% W^hM^ 1 









+ 2(^-1 



Ao ) Aq \Ao 



H + (^ 



Ao 
Ai 



Ai 

2 
1) + 



1 + 



Ai 

Ao 



"♦'xt 1 



i^" 2 



A — 2AoAi + A x 
AqAi 



2\2 



therefore 



r _ (Ao-A 1 ) 4 

X 2 \2 
A A 1 

Substituting (18) into (17) we obtain 

,(Ao-Ai) 4 



n- 



d' 1 



A A 1 



]„ Ai Ai— Aq 



Ag 



(18) 



(19) 



3 Conclusions 

• In this paper, we describe in detail the sequential probability ratio 
test in the case of patterns with an exponential distribution. 



We construct two theorems and we proof each of them. 
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• Theorem 1 supplies the selection mean volume of the sequential prob- 
ability ratio test for our considering case. 

• Theorem 2 calculates the signal to noise ratio in the case of pattern 
with exponential distribution, for n samplings. 

• The formula (19) indicates that the more samplings the better sepa- 
ration will be. 
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ABSTRACT. Using methods of geometric function theory, we get new in- 
equalities for polynomials with majorants dependent on Chebyshev polynomials 
of the first kind. These theorems refine some known results for algebraic poly- 
nomials with constraints on the the interval. 

Keywords: Bcrnstcin-typc inequality Chebyshev polynomials, geometric 
function theory. 

§1. INTRODUCTION 

Many inequalities for polynomials with different curved majorants on subsets 
of complex plane are available today ([l]-[6]). For example, Lachance studied 
polynomials with the restriction (1— x 2 )z |p(x)| < Ion the interval [— 1, 1] where 
A is a fixed positive integer [5]. In this paper we consider polynomials with 
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majorants dependent on Chebyshev polynomials of the first kind. Some special 
cases of our statements were proved earlier in [7] , [8] . To obtain estimates of 
coefficients, covering theorems and Bernstein-type inequalities for polynomials 
we use an approach proposed by Dubinin V.N. in [6]. This approach consists 
in constructing an analytic function associated with the given polynomial and 
applying some methods of geometric function theory to this function. Extremal 
polynomials in our statements are superpositions of Chebyshev polynomials of 
different kinds. 

Following [9], we introduce the functions 

Co v {z) = \{h 2v {z) + hr 2v {z)), 

S\ u {z) = \{h 2v {z)-h-^{z)), 

where h(z) = \J(z + l)/2 + \J(z — l)/2, and v is integer or half- integer. Cheby- 
shev polynomials of the first, second, third and fourth kinds are polynomials 

T n (z) = Co„(z), 



U n (z) = Si„+i(z)/V2 2 - 1, 
V n (z) = Co n+1/2 (z)/V(z + l)/2, 



W n (z) = Si n+1/2 (z)/y/(z-l)/2, 
n= 1,2, .. . correspondingly. The following representations are well known [9]: 

T n (z) = \{{z + Vz 2 - 1)" + (z- V? 2 - 1)"), (1) 



Uniz)= (z + V^i)^-^V^i)^\ (2) 



2y/z 2 - 1 


(z+V / z 2 -l)"+5+(z- 


- \/z 2 - 


- l)"+5 


( Z + yjz 2 - 1)5 +{Z- 


-Vz 2 - 


-l)i ' 


{z+^Jz 2 -l) n+ ^ -{z- 


- v z 2 - 


- l)"+5 



v n (z) = ^v<-^_'j r~ v A^ 1 . ( 3 ) 



W n{z) = ^H V- h ■ (4) 

(z + Vz 2 - l) 5 - (z - Vz 2 - l) 5 



272 KALMYKOV: INEQUALITIES FOR POLYNOMIALS 



In the first section of this paper we give some necessary information from the 
theory of bounded univalent functions and a modified form of sufficient condition 
of uni valence from the article [6]. In the subsequent sections we obtain exact 
estimates of coefficients, covering theorems and inequalities for derivatives for 
polynomials of the form 

P n (z) = c n z n + . . . + cq, c„^0, C|€l, l = 0,l,...,n, n > 1, (5) 

which satisfy one of the following conditions for some positive integer k: 

\P(z)\<l/yJl-T*(z), z£[-l,l], (6) 

\P(z)\< y/2/(l + T k (z)), ze[-l,l], (7) 

\P(z)\< y/2/(l -T k (z)), z £[-!,!}. (8) 

Let VUn.k, Wn.k, PyVn.k denote classes of polynomials (5) satisfying con- 
ditions (6), (7) and (8), respectively. 

§2. AUXILIARY STATEMENTS 

Let a function w — f(z) be regular and univalent in the unit disk U — {z : 
\z\ < 1}, and let the following conditions be fulfilled: /(0) = and \f(z)\ < 1 
for \z\ < 1. Denote the class of such functions by B. A function f £ B has the 
expansion 

W = f(z) = OL\Z + OL^Z + (X3Z + . . . . 

The following inequalities for functions from B are well known ([6], [10]): 

|a 2 /ai|<2(l-|ai|), (9) 



2 



1 + |/(^)| . 
1 + 1*1 



m 

a\z 



2 



< ( 1 1 [ f ^ ) lor all : EU. (10) 



\f'(z)\>^i/Wl if |/(«)| = N = 1, (ii) 
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and the derivative f'(z) exists at the boundary point. The equality in (9)-(ll) 
occurs for the function /(z) = a.\z with |ai| = 1. 

We will use these estimates for functions f £ B associated with the given 
polynomial P(z). We need the following lemma 

Lemma 1. Let a function z — F(w) be continuous on an open set B, 
G B C C w , regular on the set B\{w : |-F(w)| = 1} which is a family of 
domains T>. Suppose that 



lim |F(u;)| > 1 for w a G dB. 

W — >Wq 

wGB 

Further assume that F'(0) ^ and that F(w) = if and only if w = 0. 

// G D G T> then the function F(w) effects conformal and univa- 
lent mapping of the domain D onto the unit disk U . If ^ D G T> then 
F(D)c{z:\z\>l}. 

Denote by 

C = $(w) = u) — \JljJ 2 - 1 

we denote the branch of the analytic function (the inverse of the Zhukovskii 
mapping) which maps the exterior of the interval [—1,1] onto the unit disk 
|C| < 1 conformally and univalcntly. 

Lemma 2. If a polynomial P{z) G VU n ^ then the function 

satisfies the conditions of lemma 1 on the open set 



z = Fi(w) := w 



1— n— k 



$ 



iWU w+ ^ 



Bi 



w : \w\ < 1, - | ir" 



w K I \ 2 \ w 



^[-1.1] • 



Lemma 3. If a polynomial P{z) G W n then the function 



F 2 (w) := w 



l-2n-k 



$ 



2 \ w 



k)P (\ ' "- 



satisfies the conditions of lemma 1 on the open set 



Bo 



w:\w\<l,-\w 

2 \ w 



V) p U"'- 



^[-1,1] 
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Lemma 4. If a polynomial P(z) £ VW n then the function 
z = F 3 (w) := w 1 " 2 ™-^ 



+-mx^ 



satisfies the conditions of lemma 1 on the open set 

B,:=L:\v,\<l, l -(^-^)pn(rf + ^))t[-l t l] 



It follows from the representation of the Chcbyshev polynomials (l)-(4) that 
F^w) = iw for P(z) = U m (T k (z)), F 2 (w) = w for P(z) = V m (T k (z)) and 
F 3 (z) = iw for P(z) = W m (T k (z)). 

§3. ESTIMATES OF COEFFICIENTS AND COVERING THEOREMS 

Every inequality for coefficients of functions from the class B implies some 
inequality for algebraic polynomials. 

Theorem 1. Suppose that P(z) G VlA n ,k- Then the estimate 

|c„-i| + |c„|<2" 

holds. For n divisible by k this inequality becomes equality for the polynomial 
U„/k(T k (z)). 

Proof. Consider the function Fi(w) from lemma 2. In some neighborhood 
of the origin we have 

FlO) = /?iio + f3 2 w 2 + /3 3 w 3 + [3 iW 4 + .... 

It follows from lemma 1 that there exists a function 



IV 



2 , „ „3 



= f(z) = U\Z + a 2 z + a 3 z 



such that / £ B and F\(f(z)) = z in the unit disk U. 
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Wc have the following relations 



Ql 



1 



a-i 






(12) 



On the other hand from the definition of the function F\{w) we get 



j3\w 



n+k 



-02W 



n+k+1 



■few 



n+k+2 



$ 



= $ 



1 



2 

Cn-l 



w 



p I - I " 



1 



On „,,n-\-k On—I ~.,n+/c— 1 

in some neighborhood of the origin. 
Hence 

02 



2 n p 2 Xn-l 

Cn 0{ 2" 



>n-l ' 



It follows from (12), (9) that the claimed inequality holds. If n is divisible by 
k and P(z) — U n i k (T k (z)) than F(w) = iw. Therefore this inequality becomes 
equality. The theorem is proved. 

The inequality from theorem 1 implies the estimate 

\c n \ < 2", 

which is equivalent to the property of the polynomial U n / k (Tk(z))2~ n to have 
the least deviation from zero on the interval [—1,1] with weight ^JY — TY(z) 
among polynomials P(z) — z n + .... 

Theorem 2. Suppose that P(z) E W n k (PW^t). Then the estimate 

\Cn\ < 2" 

holds. For n divisible by k this inequality becomes equality for the polynomial 
V n /k(T k (z)) (respectively W n / k (T k (z))). 

To prove this theorem it is sufficient to consider the functions F 2 (w) and 
F^(w) from lemmas 3 and 4 in place of Fi(w). 

Theorem 3. If P(z) € VU n ^ then for any number r, r > 1 the image of 
the ellipse \z — 1| + \z + 1| = r + l/r under the mapping u = yl — T k 2 (z)P(z) is 
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a curve lying inside the ellipse with foci ±1 and the major axis xo. r r ™ +1 + 
r n+ /xo, r < r~ n ~ k + r n+k , where xo, r is the root of the equation 

|c„|(l-r) 2 .x = r2"(l-x) 2 , (13) 

belonging to the interval 1/r < x < 1. 
For 

r>r x = 2\-l + 2y/\(\-l), A = 2"/|c„|, 

i/ie image of the ellipse \z — 1| + \z + 1| = r + 1/r under the mapping 
lo = yj\ — T 2 (z)P(z) is a curve lying outside the ellipse |w — 1| + |w + 1| = 



Xi, r r ™ + + r n+ jx\ %r , where x\ iT is the root of the equati 



ion 



|c„|(l + r) 2 .x = r2"(l + x) 2 , (14) 

belonging to the interval 1/r < X < 1. 

7/ n divisible by k and P{z) = U n /k(Tk(z)) then XQ t , r — xi yT = 1/r anc ^ 
the image of the ellipse \z — 1| + \z + 1| = r + 1/r under the mapping u) = 
■y/l — T 2 (z)P(z) is the ellipse with foci ±1 and i/ie major axis r~ n ~ k + r n+k . 

Proof. Consider the function 

z = F(w) = f3 lW + (3 2 w 2 + ... 

from lemma 2. It follows from the proof of theorem 1 that f3\ = i2 n /c n . Let 
r > 1 and w be an arbitrary point on the circle \w\ = 1/r. 
If \F(w)\ > 1 then 

> | w |"+ fc - 1 = (l/r) n+k -\ (15) 

Suppose that |.F(w;)| < 1. By lemmas 1 and 2 the point w is the image of 
some point z G U under the inverse mapping w = F^~ (z) G B. Using (10) and 
theorem 1, we get 

M 2" 1/r 1/r 

>^7^T7Z^>7^T7I^- (16) 





i 


(w k - 


1 x 


\ „ 


(\ 


( 1 


<l> 


- 




^ 


~ 


w+ — 




|_2 


V 


W k j 


/ 


^ 


\ w 



(1-N) 2 - |c„|(l-(l/r))2-(l-(l/r))2 
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The function f(z) = z/{\ — z) 2 is strictly increasing in [0, 1). Hence inequal- 
ities (16) are equivalent to 

\z\ > x 0jr > 1/r, 

where xo, r is the root of the equation (13), 1/r < x$ tr < 1. Therefore 



$ 



if w *-\\p I L l 



,y , r ,y- Vn - „.M > ^(l/r)^- 1 > (l/r)»+*. (17) 

Comparing (15) with (17), we get inequality (17) for all w on the circle 

\w\ = 1/r. Hence the point A I w k ^ ) P I A I w + ^ ) ) lies inside the ellipse 

with foci ±1 and the major axis xo, r r~ n ~ k+1 + r n+ /xo, r < r~ n ~ k + r n+k . 
In other words the point y\L — T 2 (z)P(z), where z = k I w + A. ) , i.e. \z — 
1| + \z + 1| = r + 1/r, lies inside the ellipse with foci ±1 and the major axis 
X , r r~ n ~ k+1 + r n+k - l / X Q, r < r- n - k + r n+k . 

Now we consider 

r>r A = 2A-l + 2\/A(A-l), A = 2"/|c„|. 

Let w, \w\ = 1/r, r > r A , be a point of the set f(\z\ < 1). We have 
|.FiMI < 2" 1/r < 2" l/r A _ 1 



(1 + |FiH|)2 " \ Cn \ (1 + 1/r) 2 " |c| (1 + l/r A ) 2 4' 
Hence equation (14) has the unique root x\_ r G [1/r, 1) and this root satisfies 
the bound 

l^lMI < Zl,r- (18) 

Now we will demonstrate that any point w, \w\ < l/r A , belongs to the domain 
f(\z\ < 1). Suppose the contrary is true. Then 

r\ < r* = inf{r : r > 1, |i*i(w)| < 1 \/w, \w\ = 1/r}. 

There is a point w* on the circle \w\ — 1/r* such that 

\Fi(w*)\ = 1. (19) 

On the other hand, for any sequence of points W/j,|wfc| < 1/r*, k — 1,2,..., 
converging to w* the bound (18) yields 

|-Fl(tOfc)| <*l,r«, fc = l,2,..., 
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which contradicts condition (19). It follows from (18) that 



$ 



vt-^pdu- ' 



yiU + k— 1 



2 V w fc / V 2 V w. 

It is equivalent to the respective statement of the theorem. 

The last statement of the theorem is immediately obtained from the repre- 
sentations of Chebyshcv polynomials (1), (2). The theorem is proved. 

Proofs of the following two theorems are similar. 

Theorem 4. If P(z) G fV n ,t then for any number r, r > 1 the im- 
age of the ellipse \z — 1| + \z + 1| = r 2 + 1/r 2 under the mapping u) = 
y(l + Tk(z))/2P(z) is a curve lying inside the ellipse with foci ±1 and the 
major axis x r~ 2n ~ k+1 + r 2n+k ~ 1 /x < r~ 2n ~ k + r 2n+k , where x , r is the root 
of the equation (13) belonging to the interval 1/r < x < 1. 

For 

r>r x = 2X-l + 2y/\(X-l), \ = 2 n /\c n \, 

the image of the ellipse \z — 1| + \z + 1| = r 2 + 1/r 2 under the mapping lo = 



y(l + Tk(z))/2P(z), is a curve lying outside the ellipse \u) — 1| + \u> + 1| = 
a;i :r r~ 2 ™~ fe+1 +r 2 " + ' £ ~ 1 /:z!i. r , where X\, r is the root of the equation (14) belonging 
the interval 1/r < x < 1. 

// n divisible by k and P{z) = V n /k(Ti~(z)) then xo_ r = x\. r = 1/r and 
the image of the ellipse \z — 1| + \z + 1| = r 2 + 1/r 2 under the mapping lo = 
-\/(l + Tk(z)) /2P(z) is the ellipse with foci ±1 and i/ie major axis r~ 2n ~ k + 



r 2n+k 



Theorem 5. // P{z) G VW n .k then for any number r, r > 1 i/ie im- 
age of the ellipse \z — 1| + \z + 1| = r 2 + 1/r 2 under the mapping lo = 
■\/(l — Tfc(z))/2P(z) is a curve lying inside the ellipse with foci ±1 and i/ie 
major axis xor~ 2n ~ k+1 + r 2n+k ~ 1 /xo < r ~ 2n ~ k + r 2n+k , where xo, r is the root 
of the equation (13) belonging to the interval 1/r < x < 1. 

For 

r>r A = 2A-l + 2v/A(A-l), A = 2 n /|c„|, 
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under the mapping u> = y(l — Tk(z))/2P(z), the image of the ellipse \z—l\ + \z+ 
1| = r+l/r is a curve lying outside the ellipse \oj — 1\ + \oj + 1\ = xi. r r~ 2n ~ k+1 + 
r 2n+ /xi r , where x\ r is a root of the equation (14) from the interval 1/r < 
ar < 1. 

// n divisible by k and P(z) = W n /fc(Tk(;?)) i/ien rro.r = a?i,r — V r anc ^ 
i/ie image of the ellipse \z — 1| + \z + 1| = r 2 + 1/r 2 under the mapping lu = 
\/(l — Tf.(z)) /2P(z) is the ellipse with foci ±1 and the major axis r~ 2n ~ k + 

r 2n+k 

§4. INEQUALITIES FOR DERIVATIVE OF A POLYNOMIAL 

Theorem 6. If P(z) e VU n M then 

k\P{x)T k {x)T' k {x) - P'(x)(l - T 2 k {x))\ < 

<(n + k-l + y/fcj¥\)\r k (x)\fl - (1 - T*(x))P*(x), xe[-l,l]. (20) 

If n divisible by k and P(z) = U n /k{T k (z)) then this inequality becomes 
equality for all x G [—1,1]. 

Proof. Consider the function F\(w) from lemma 2 on the set B\. The set 
B>i\{w : \F 1 (w)\ = 1} is a family of domains. Denote this family by T>i. It 
follows from lemma 1 that if D G T>\ and ^ D then F\{D) C {z : \z\ > 1}. 
If G D G T>\, then z = Fi(w) maps the domain D on the unit disk U 
conformally and univalently Suppose that a point x G [—1,1] satisfies the 
following condition: if (w + l/w)/2 = x, \w\ = 1 then wis a point of regularity 
of the function Fi(w). (All points x of the integral [-1,1] satisfy this condition 
possibly except for finite number of points). If w G dD,D G T>\ and ^ D, 
then at the point w we have 

PI < 0. (21) 

If «; G dD, G £> G 2?i, then it follows from (11) that 



^ < JM. (22) 
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Comparing (21) with (22) we get the inequality (22) for all regular points 
wedD, D £ X>i. 

If \w\ = 1 then the point 



uj := - \w — 

2 y »- fc 



belongs to [-1,1]. Hence 

l*'MI 



i± 



vT 



Vi -to 2 ' 



A direct computation shows that 



8\F 



d\w 



ii = 1-n-k 



VI -LU 2 



l("'*+i)'* ! " - 



+ l('"'-i) P ' W ("'^)- 

If x = -ij (w + l/u>) then from the representation of Chebyshev polynomial we 



get 
1 



'«;" 



Tk{x), 



k -^) = y/ 1 -'W x )> T ^ 



k \w 



( w -w) 



Using the last equalities we obtain the inequality (20). 

If n is divisible by k and P(z) = U n / k {T k (z)) than F(w) = iw. Therefore 
this inequality becomes the equality. The theorem is proved. 

Theorem 7. If P(z) £ VV n . k then 

y/l-x*\2P'(x)(l + T k {x)) + T k (x)P(x)\ < 



<(2n+k-l + ^1^/2-1)^(1 + T k {x)){2 - (1 + T k (x)) P 2 (x)) , x £ [-1, 1]. 

// n divisible by k and P{z) — V n / k {T k (z)) then this inequality becomes 
equality for all x £ [—1,1]. 

Proof. Consider the function F?,(w) from lemma 3 on the set B 2 - Let the 
set B 2 \\w : \F 2 {w)\ — 1} is a family of domains. Denote this family by T) 2 . It 
follows from lemma 1 that if D £ V 2 and <£ D then F 2 (D) C {z : \z\ > 1}. 
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If G D G T>2, then z = F 2 (w) maps the domain D on the unit disk U 
conformally and univalently. Suppose that a point x G [— 1,1] satisfies the 
following condition: if (w 2 + l/w 2 )/2 = x, \w\ = 1 then w is a point of regularity 
of the function F 2 (w). (All points x of the integral [-1,1] satisfy this condition 
possibly except for finite number of points). If w G dD,D G T> 2 and ^ D, 
then at the point w we have 



d\F 2 
d\w\ 



<0. 



If w G 5L>, G D G 2? 2 , then it follows from (11) that 

d\F 2 \ 



a\% 



< 



\Cn\ 

2™ 



(23) 



(24) 



Comparing (23) with (24) we get the inequality (24) for all regular points 

we 8D,D eV 2 . 

If \w\ = 1 then the point 



1 



1 



P 



1 



belongs to [-1,1]. Hence 

l*'MI = 



i± 



s/T^J 1 



Vi-lu 2 ' 



A direct computation shows that 



d\F 2 
d\w\ 



= 1 - 2n - k + 



VT 



(»*-^)p(*)+ 



+ ^[w k + ^)P / (x)(w 2 , 

"lift. / \ 111" 



We have 



Therefore 



1 



2 V w^ 



1 



W = ^ K + 



,2A' 



- U K ± 



Tl. 



Hence 



w K ± 



y/2y/T k (x)±\, T' k {x) 



k l„.,2k 



,,-!• 
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Using the last equalities we get inequality from the statement of the theorem. 
If n is divisible by k and P(z) — V n / k (T k (z)) than F(w) = w. Therefore 
these inequalities become equalities. The theorem is proved. 

Theorem 8. // P(z) e V\V n , k then 

Vl-x2\2P'(x)(T k (x) - l)+T' k {x)P{x)\ < 

< (2n+ k - 1 + ^\c n /2"\)^(l - T k {x)){2 + (T k (x) - l)P 2 (x)), jg[-l,l]. 

For n divisible by k this inequality becomes equality for the polynomial 
W n/k {T k {z)) 

Remark 1. If we assume k = 1 in our statements then we obtain some 
results from articles [7], [8]. 

Remark 2. If we assume k = 2 in the theorems 2, 4, 7 then we get 
statements for polynomials with restriction |P(x)||ie| < 1 on the interval [—1, 1]. 
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Abstract. We consider so called jump problem, i.e. the problem on evalu- 
ation of holomorphic in C\T function with given difference of its boundary 
values on curve I\ If this curve is rectifiable, then solution of the problem 
is representablc as the Cauchy integral over I\ The present paper is dealing 
with the case of non-rectifiable path I\ Generally speaking, in this situation 
the integral over T does not exist, but the problem can be solved in terms of 
polygonal approximation of this curve. 

Key words and phrases: holomorphic function, jump problem, non-rectifiable 
curve, approximation, fractal. 

1. Introduction 

We consider the following boundary value problem for holomorphic functions. 
Let r be a closed Jordan curve on the complex plane C bounding finite domain D + , 
and D~ = C\ D + . A function /(£) is defined on I\ We seek a holomorphic in C \ V 
function <&(z) such that $(oo) = 0, the boundary values \im D + Bz ^ t &(z) = < & + (t) 
and lim D - Bz ^ t $>(z) = &~(i) exist for any t € T, and 

$+(t)-*-(t) = /(t),t€r. (l) 

The problem is called jump problem. It is well known and has numerous appli- 
cations in elasticity theory and so on (see, for instance, [12, 4]). If the curve T is 
piecewise-smooth and the jump f(t) satisfies the Holder condition 

^P{ m ^~l { f )l ■■ f,*" G r,*V *"} = K(f,T) < oo (2) 

with exponent v G (0, 1], then unique solution of this problem is the Cauchy integral 

Below we denote -ffjy(r) the set of all functions satisfying (2). 

If the curve T is rectifiable but non-smooth, then the Cauchy integral (3) has 
boundary values <& ± if / satisfies the Holder condition with exponent v > \ (see 
[2, 13]). It is not difficult to show that this boundary values satisfy the equality 
(1), i.e. the jump problem on non-smooth rectifiable curve is solvable if the Holder 
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exponent of the jump exceeds ^ ■ As shown in [2] , this result is cannot be improved 
in the whole class of rectifiable curves. 

Now let r be non-rectifiablc. Then the jump problem (1) keeps the sense, but 
the Cauchy integral over that curve loses definiteness. 

The author researched the jump problem for non-rectifiable curves (see, for in- 
stance, [5, 6, 7]) and obtained certain conditions of its solvability in terms of various 
fractal dimensions of curve I\ In particular, he proved [6] that the jump problem 
on non-rectifiablc curve is solvable under assumption 

Dmr ^ 

Here DrnT means upper metric dimension of the set T, i.e. 

n r r lo & N ( £ ) 
Dm 1 = lim sup , 

£ ^o - log £ 

where N{e) stands for the least number of disks of diameter e necessary for covering 
of the set V (see [10, 3]). The bound (4) cannot be improved on the class of curves 
of fixed upper metric dimension. Note that in general the solution of problem (1) 
on non-rectifiable curve cannot be represented by the Cauchy integral. 

In the present paper we study the jump problem by means of polygonal approx- 
imations of non-rectifiable curve T. The scheme of that approach is rather easy. 
Let Ti, T2, . . . , r„, . . . be a sequence of polygons approximating T in some sense. 
We extend the jump / from T onto the whole complex plane and consider the limit 
of Cauchy integrals 

..(,) = to ± [ ^ (5) 

n-+oo 2m J T Q — z 

where f £ is extension of the jump /. The integrals are defined here because the 
polygons r„ are piecewise-smooth. If the limit exists and has boundary values <f> f 
satisfying (1), then it is a solution of the jump problem. This approach enables us 
to obtain an approximate solution of the jump problem with a bound for its error. 
A non-rectifiable curve can be approximated by polygons in various ways. In 
the next section we are dealing with so called monotone approximations. Then we 
consider rcprcsentability of the solution (5) by the Cauchy integral (Section 3) and 
certain examples and commentaries (Section 4). 

2. Monotone Polygonal Approximations 

We use the following notation. The width w(S) of finite domain 5 is diameter 
of the most open disk lying in 6. The Hausdorff distance between sets A and B is 
defined by equality 

dist#(A B) := sup{dist(z, B) : z e A} + sup{dist(z, A) : z e B}. 

If 7 is rectifiable curve, then \j\ stands for its length. 

Definition 1. We call a sequence of closed polygonal lines Ti,r2, . . . ,T„, . . . in- 
creasing (or decreasing) approximation of T if these lines bound finite domains 
D^ , D^ , . . . , D+, . . . such that D+ C D^ +1 C D + (correspondingly , D+ D D^ +1 D 
D + ) for any positive integer n and linin^oo dist#(r n , T) = 0. 
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We call a sequence I^, T%, . . . , T n , . . . monotone approximation of T if it is cither 
increasing or decreasing approximation of this curve. If a sequence of polygons 
{r„} approximates a non-rectifiable curve, then |r„| — > oo. 

Let a sequence {r„} be monotone polygonal approximation of T. We put A„ = 
Dn+i \ Dn if this approximation increases, and A„ = £>+ \ D^ +1 if it decreases, 

n = 1, 2, As D+ and D^ +1 are polygonal domains, since A„ is either polygonal 

domain (maybe, double connected) or union of finite number of disjoint polygonal 
domains. Let A„ stand for sum of perimeters of all connected components of A„, 
and tv n for maximal width of these components. 

In what follows we use also the Whitney extension operator (see, for instance, 
[14]). If a function f(t) is defined on a compact set K C C and satisfies the Holder 
condition with exponent v, i.e. / G H U (K), then its Whitney extension f w (z) is 
defined on the whole complex plane C and satisfies the Holder condition with the 
same exponent v, i.e. f w £ H„(C). Moreover, h v (f w , C) = h v {f,K). In addition, 
f w (x + iy) has partial derivatives of all orders at any point x + iy = zEC\K and 

ivr(z)|<^(/,x)dist"- 1 (z,r). 

Lemma 1. Let 6 be finite domain with Jordan rectifiable boundary 7, / G H u {^), 
and f w is the Whitney extension of the function f from the curve 7. If p < jzr^, 
then 

fl'dxdy < C'<(/,7)|7|a. 1 -'< 1 -"'(«), 



/A 1 *'" 



where C < 4tt(1 - 2p( 1 -")- 1 )- 1 . 

Proof. We consider the Whitney decomposition of domain 5 (see [14]). It consists 
of dyadic squares Q such that a(Q) < dist(z,7) < 4a(Q) for any z € Q, where a(Q) 
is length of side of Q. Let m n stand for number of squares Q such that a(Q) = 2~ n . 
Then the cited above bound for \Vf w (z)\ yields inequality 



\Vf w \ P dxdy<K(fr/) E 2 ~" 



(2-p(l-i/)) 

i'l>r. 



n— — 00 



All m„ squares with side 2~™ are situated in 4 • 2~"— neighborhood of the curve 7. 
The square of the neighborhood does not exceed 47r|7|2 _n . Hence, m n < 47r|7|2™. 
But m n = for w{8) < 2~ n . Whence, 

|V.r| p ^d2/<4^(/,7)|7| J2 2-"( 1 -^)). 

S 2-™<w(6) 

The series converges under assumptions of the lemma and 

pn(p(l-i/) 

2-"<w(S) n=0 

what concludes the proof. 



00 1— p(l — u)(x\ 
V- 2 - n (l-p(l-v)) < w l-p{l-u) {S] V^ 2 n(p(l-u)-l) = w W 



Theorem 1. // non-rectifiable curve T has a monotone polygonal approximation 
{r„} such that 

00 

£ A„u^(i-) < oo (6) 

n=l 
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for certain v G (0, 1) and p > 2, then any function f £ iJ„(r) has an extension f 
such that equality (5) defines a solution of the jump problem (1). 

Proof. We extend the jump / in the following way. First we apply the Whitney 
extension operator and obtain an extended function f w {z). Then we put f2 := 
n ra >ir„, restrict f w on compact f2 and apply again the Whitney operator to this 
restriction. As a result of this double Whitney extension we obtain a function / . 
It has all mentioned above properties of the function f w , but additionally it has 
the following property: 

- restriction of the function f £ on any connected component S of C \ CI equals 
there to the Whitney extension of the restriction of f w on the boundary dS of this 
component. 

This property follows from the construction of the Whitney extension operator 
(see [14]). Below we call it restrictive property. 

Let us consider an increasing polygonal approximation {T„}. Obviously 

$„(*)■- l f f £ (0d(_^ 1 f f £ (C)dC 



27 ™ Jr n C - z j^ Q 2m J dA . ( - z 

where Ao = D^. The first partial derivatives of / are integrable in any connected 
component of Aj,j = 0, 1, . . . ,n, by virtue of the restrictive property and Lemma 
1 . Therefore we can apply the Cauchy-Green formula to each term of the last sum 
and obtain 

^[..,* .,*£,, , 1 ff df £ (()d(d(\ 



Here and in what follows x(A, z) means characteristic function of set A, i.e. x(^ z ) 
1 for z G A and x(A z ) — for z € C \ A. Hence, 



2niJJ D + d( C 

d Lemma 1 
grable in D + . Consequently, the limit (5) exists and 



According to the restrictive property and Lemma 1 the derivative A is inte- 



*,M = ^WM + i// «« P) 



2tti J J D + d( C - z 

The term x(-D ' , z)f (z) has jump / on T. The derivative »4 under assumption 

(6) is integrable in D + with degree p > 2. Consequently, the integral term of (7) is 
continuous in the whole complex plane (see, for instance, [?]). Thus, the theorem 
is proved for increasing approximations. The proof for decreasing approximations 
is the same. 

Obviously, A n < |r„| + |r n +i| and u>„ < dist#(r„,r). Hence, there is valid 

Corollary 1. If non-rectifiable curve T has a monotone polygonal approximation 
{r„} such that 



£)(|r n | + |r n+ i|)dist^- p(1 -' ) (r n ,r) <oo (8) 
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for certain v G (0, 1) and p > 2, then any function f G -ff„(T) has an extension f 
such that equality (5) defines a solution of the jump problem (1). 

Moreover, we can change the sequence {r n } by its subsequence with prescribed 
decrease of dist#(r„,r), for instance, dist#(T„,r) < n~ a . If |r„ < en 13 , then 
the series (8) converges for a(l — p(l — v)) — (3 > 1, and condition p > 2 implies 
v > a+ 2a ■ The left side is less than one if a > [3 + 1. Thus, we obtain 

Corollary 2. Let non-rectifiable curve T have a monotone polygonal approximation 
{r„} such that dist#(r n ,r) < n~ a and \V n \ < en 13 for n = 1,2, . . . . If a > [3 + 1 

and 

a + B+ 1 

»> 7, , 

Za 

then any function f <G -ffiy(r) has an extension f £ such that equality (5) defines a 

solution of the jump problem (1). 

The following corollary can be proved in just the same way. 

Corollary 3. Let non-rectifiable curve T have a monotone polygonal approximation 

{r„} such that dist H (T n ,T) < 2~ n and \T n \ < c ■ 2 an for n = 1, 2, . . . . If a < I 

and 

a + l 

then any function f G H V {T) has an extension f such that equality (5) defines a 
solution of the jump problem (1). 

Now let us consider $„ as approximation of the solution $* and bound the error, 
i.e. the difference $* — $„. Obviously, this difference cannot be small in domain _R+ 
equaling to D + \ Dn for increasing approximation and to D+ \ D + for decreasing 
one. Therefore, the intrinsic definition of the error is 

E n := supd^O) - <& n {z)\ : z G R-}, 
where R~ := C \ Rn ■ According to (7), under assumption of Theorem 1 we have 

where the sign is plus for increasing approximation and minus for decreasing one. 
Hence, 



E n < - 

IT 



1/P / r r \ 1/q 



JhM ic) \ d * d ») {ILi K -' rixdy 



where p _1 + q^ 1 — 1, z = x + iy and z G R~. According to Lemma 1 



df £ 



dxdy<CK(f,T) J2 K^ n - p{1 ~ v \ 

j=n+l 



where C is constant from the lemma. If d, d n and r„ are diameters of T, T n and 
Rn correspondingly, then r n = d for increasing approximation and r„ — d n for 
decreasing one. As q < 2, since 

2tt „ „ 



K-z\-"dxdy< I/ \(-z\-idxdy< 

\C-z\<r n * — Q 
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If the approximation decreases, then d n < 2d for sufficiently large n. Thus, there 
is valid 

Theorem 2. If all assumptions of Theorem 1 are fulfilled, then 

i/p 

E n < Mh v (f,T)d( p -V/ p I J2 X ^1~ P{1 ~ 

\j=n+l 

where M < 2 ( - c '+^^(l - 2p( 1 -")- 1 )- 1 /p(2 - q)- 1 ^. If the approximation increases, 
then this bound fulfils for any n, and for decreasing approximation it is valid for all 
n such that d n < 2d. 

This theorem means that under assumptions of Corollary 2 we have E n = 
0{n~^ +e ), where fi = 2va — (a + [3 + 1), and under assumptions of Corollary 
3 E n — 0(2~( K_£ )"), where k — 2v — (a + 1). Here e stands for arbitrarily small 
positive value. 

3. Inscribed Polygons and Cauchy Integral 

Generally speaking, the limit (5) cannot be represented by the Cauchy integral 
because the integral J r fdz over non-rectifiable path V is not defined. It is of 
interest, that the representability of <&* by the Cauchy integral is related with 
polygons which are inscribed into the curve V . 

We consider all finite sequences t = {ti,t2, ■ ■ ■ ,t n } of points on the curve T 
numbered in order of path-tracing. That sequence determines polygon P T consisting 
of segments [tj,tj+i],j = l,2,...,n (here and below t n+ \ = t\). This polygon is 
inscribed into T; it may be self-intersecting. Let ^f(x) be real continuous increasing 
function defined for x > so that \l/(0) = and lim^^oo ^>(x) = oo. By ip(x) we 
denote its inverse function. We relate with an inscribed polygon P T the following 
sum: 

n 

a 9 {P r ):=J2^{\t j+1 -tj\). 

i=i 

Definition 2. A path T is called ^—rectifiable if there exists a constant C > 
such that a^(P T ) < C for any inscribed polygon P T . 

If *f?(x) = x, then any "J— rcctifiable curve is rectifiablc in usual sense. But if, 
for instance, ^(x) = x p ,p > 1, then ^—rectifiablc curve can be non-rectifiable. 

Let z = £(x) be a mapping of segment / = [0, 1] onto the curve T. Then 
<7*(P r ) := X^7=i ^i\£( x j+i) ~ £( x j)l)> where {xj} is increasing sequence of points 
on /. A function £(x) is called function with bounded tp— variation if the sums 
Xw=i ^(\£,( x j+i) ~~ £,( x j)\) are bounded (see [15, 11]). Hence, V is "f— rcctifiable if 
and only if the mapping £ has bounded , f— variation. This fact enables us to apply 
L.C. Young's theory of Stieltjcs integral (see [15, 11]). As shown in [8], the Cauchy 
integral in the Stielties form 

±- [ f(()dlog(C-z) 
2m J r 

exists by virtue of the Young theorem [15] if / has bounded O— variation and 
T^=i i>{ )&{ ) < °°; nere @ is a function satisfying all our assumption on the 
function "J, and 9 is its inverse function. In particular, if T is "f— rcctifiable and 
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/ e H V (T), then / has bounded ©-variation for 0(x) = *((f ) 1/l/ ), /i = h v {f,T). 
In this case 0(x) = hip v (x), and the Cauchy integral exists in the Stieltjes sense if 

DC 

X ~" - l+ " ' ■ ' <oo. (9) 



>' + "6 



n— 

Moreover, there is valid the following result. 

Lemma 2. 7 [8] Let T be "f — rectifiable curve, where the function ^ is convex and 
satisfies the condition (9). If a function f € H„(T) has an extension F G iJ„(C) 
and F has first first partial derivatives which are locally integrable on the complex 
plane, then there is valid the Cauchy- Green formula 

Our double Whitney extension f £ satisfies assumptions of this lemma. Thus, we 
obtain from the last lemma and equality (7) 

Theorem 3. Let V be a ^—rectifiable curve, where the function W is convex and 
satisfies the condition (9). If it has a monotone polygonal approximation satisfying 
condition (6), then the jump problem (1) has a solution (5) for any f e H V (T), and 
this solution is representable as the Cauchy integral 

$*(*) = ^ [ f(()dlog((-z), 
2m J r 

where integration is understood in the Stieltjes sense. 



4. Examples and Commentary 

4.1. We consider first so called von Koch snowflake. This well known self-similar 
fractal curve bounds domain D + = T LKU^Li U ,-=i T n j), where T is regular 
triangle with unit sides and T n j are similar regular triangles with sides 1/3™. We 

put D^ := T U(Un=i Uj=i T nj), r v := dD%. Obviously, pre-fractals T N forms 
increasing approximation of T. The difference Ajy — Dj +1 \ D^ consists of 3 • A N 
triangles with side 3 _JV_1 . Hence, its perimeter is Ajy = (4/3) N , and its width 
is con = c • 3 _JV ,c = \/3/9. The series (6) converges for v > log 3 2. The fractal 
dimension of the von Koch snowflake equals to log 4 2 = 2 log 3 2. Thus, for this curve 
the condition (6) for solvability of the jump problem coincides with (4). 

4.2. The following example shows that the condition (6) can improve (4). Let 
(3 > 1. We put K n — 2^1, where square brackets mean entire part, and divide 
the segment [2 _n , 2~™ +1 ] of real axis into K n equal parts of length a n — 2~ n /K n 
each one. We denote by x n j the ends of these parts, i.e. x n j = 2~ n + ja n ,j — 
0, 1, ... , 2^ n ^ — 1, and consider vertical segments I n j — [x n j, x n j + i2~ n \. Then we 
fix a decreasing sequence of positive numbers e n such that e n < \ct n ,n = 1,2,..., 
and consider mutually disjoint rectangles 6 n j = {z = x + iy : x n j < x < x n j + 
e„,0 < y < 2~ n }. Let So be square {z = x + iy : < x < 1,0 < y < 1} 

and D + = Sq\ ( l J^L 1 U^ =0 ~ Sn.j), i-c. D + is unit square with countable set of 
rectangular cuts condensing to origin. We denote by T* the boundary of domain 
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D + . It is non-rectifiablc. As shown in [5, 6], 

2/3 
^^ (10) 

in the case e n = \a n . But the considerations of these papers keep correctness for 
e n < 2 a n, too. Thus, the equality (10) is valid under assumptions of the present 
example. 

The polygons T* N = dD+, D+ = S \ (U^ =1 ufl q'" 1 5 n j), TV = 1, 2, ... , form 

decreasing approximation of T* . The difference Ajy = U ? -_ ~~ # n ,j here is union of 
finite family of rectangles, and values of its perimeter and width are evident. For 
instance, if e„ — 0(exp (— a" 1 )), then the series (6) converges for v > \, what is 
essentially better than v > Bm 2 r . 

4.3. A polygonal approximation can be constructed in terms of the Schauder se- 
ries. Let z = £(x) be a continuous mapping of segment / = [0, 1] onto the curve 
r.We extend £(x) into the Schauder series £(x) — X)i=o a i^i( x )' wnere ^j(x) are 
the Schauder functions. These functions arc pieccwisc-lincar (see, for instance, [1]). 
Hence, a partial sum £ n (x) = Y2i=o a J^j( x ) ma P s I onto a polygon T n . But the 
Shaudcr scries uniformly converges to £(x) (see [1]). Thus, the sequence r 1 ,r 2 , . . . 
approximates the curve T (generally speaking, non-monotone one). Certain solv- 
ability conditions for the jump problem in terms of the Schauder coefficients are 
obtained in [9]. 

The research is supported by Russian Foundation for Basic Researches, grant 
07-01-00166-a. 
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1 The statement of the problem. 

In the domain O = {(x, t) : < x < p, < t < T} let us consider the equation 

Uxxxx +C DQ t U = / (x, t) (1) 

where 1 < a < 2, c-^ot is Caputo fractional the ath derivative operator with 
respect to t [1],[4]. 

If a = 2 the equation (1) changes to the well-known equation of transverse 
vibration of elastic rods 

Uxxxx +U tt = f(x,t), 

And if a — 1 the equation (1) changes into 2parabolic equation 

Uxxxx i U,f J ^X, X ) . 

Caputo fractional derivative operator expresses as Riemann-Liouvillc fractional 
integral [1] 

Problem. In the domain ft — {(x,t) : < x < p, < t < T} find u(x, t) that is 
the solution of the equation (1) and is satisfying the following conditions 

d 2m u(0,t) d 2m u( P ,t) 

dx 2m = dx 2m = 0, TU = 0, 1, < t < T, (3) 
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u(x, 0) = <p(x), u t (x, 0) = ip(x), < x < p. (4) 

Theorem. If f(x,t) G C^p], f(0,t) = f(p,t) = 0,^^ G L»p^[0,p] is 
uniformly in t, < 7 < 1. ip (x) , ip (x) £ C[0,p],<£>' (a;) , ^ (a;) are piecewise 
continuous on [0,p], <p (0) = ip (p) = 0,ip (0) = ip (p) = 0, then there is a solution 
of the problem in class C x ' t (H) n C x ' t (O). 

Proof. We research the regular solution of the problem in the form of 

00 
u(x,t) = ^2u n (t)X n (x), (5) 

71=1 

here _ 

/2 717T 

- sin X n x, A n = — ,n e N. 
P P 

Substitute (5) into the equation (1) we get 

c Dl t u n {i) + \ i n u n {t) = f n {t) (6) 

where 

p 

/„(*) = f f(x,t)X n (x)dx 



Conditions (4) change to the following conditions 

u„(0) =<p n ,u' n (0) =tp n 

here 

p p 

<p n = ip(x)X n (x) dx, ip n — / ip(x)X n (x) dx. 


Using (2) we reduce equation (6) to the form 

iZ; t a u n (t) + \ A n u n {t) = f n {t) 

Acting to both sides of this equation by the operator I§ t we get Volterra integral 
equation of the second kind 

,4 * 



K_ 1 (i _ ^o 

I» 



U n (*) + F73 / (* " T T U « ( T ) dT = Vn + *l>n + JSt/n (t) 





Using the successive approximation method we find the unique solution 

t 
u n (t) = <p n E aA (-\ 4 n t a )+t^ n E a , 2 {-\ i n t a )+ j (t - r) a - 1 E a , a (-\ 4 n (t - r) a ) /„ (r) dr 



294 AMANOV-KUZIBAEV:BVP-FRACTIONAL DERIVATIVES 



here 

oo 

E a , [3 (z) = J2 



is Mittag-Leffler function [3]. 

Then the solution of the problem rewrites in the form of 

oo 

u{x,t) =EX n (x) [<p n E ail (-\it a )+t^ n E aa (-X 4 n t a )] + 



71=1 

OO t 



+ E Xn (x) f(t- T) a - 1 E a , a (-A* (i - r) a ) /„ (r) dr 

n=l 

It is easy to check that 

oo 

limu(M) = E <P n x n(x) = ¥>(z), 

*^° n=l 

oo 

H%= E*P n Xn(x) = lP(x). 

*^ u n=l 

If we show the uniform convergence of the series 

oo 

E X^ n X n (x) [p n E atl {-Xit a )+t^ n E at2 (-X 4 n t a )] + 



(7) 



oc 



n=l 

t 

\4 



+ £ A„*» (x) J(t- rf- 1 E a . a (-A* (t r) a ) f n (r) dr 

n=l 



(8) 



then the series (7) and 

CO CO 

C D« t U = £ /n(t)*n(*) - E K>Xn (x) [p n E a>1 (-<*<*) + t^E^ (-A^)] ■ 



n—1 n— 1 

CO t 

- E A*X„ (*) / (t - r)- 1 E a , a (-A* (t - r) a ) /„ (r) dr 

n=l 

convergence uniformly, too. 

The series (8) we represent as the sum Ii + I 2 + h, where 



(9) 



h= EA 4 „uJ»(^«,i(-^), 

n=l 

CO 

^2 = E X^ n X n (x)tE a . 2 (-x 4 n t a ) , 

n=l 

CO t 

h = E A^Jf n (x) / (t - rf- 1 E a , a (-X 4 n (t - r) a ) /„ (r) dr. 

71=1 

Now we need the following estimate 

M 

\E a A z )\ < y— - r \z\>0,M = const >0 (10) 

1 + \z\ 
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Using (10) we show the convergence of the I\ 

n— 1 n— 1 n 

Let < to < t < T. In terms of the theorem we have 

oo,4|i 00,4 _. oo 

1 7 A 4 , a < Zv T4T7 l^nl = p Zv 1^1 < °° 
n= l i + <V* „=i A n*0 r n=1 

Analogously for I2 we get 

00 M 00 

2 Af, |VJ * |^,2 (-A^ Q ) I < — T E 1^1 < ™- 
n=l „=i 

And now we show the convergence of the/3. It majorizes by the following series 

00 t 

EA*/(*-t) |^a,a(-A*(t-r) a )||/ n (r)|dr< 

71=1 

n=l "0 n n=l "0 n 

00 00 

= CMEiw ln(2A^«) < CM VJ prx (in 2T« + f In A* ) , e > 

n— 1 n n—1 n 

Let < e < <5. Then the above series converges uniformly. 
Theorem is proved. 
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Abstract 

We first explain the research problem of finding the sparse solution 
of underdetermined linear systems with some applications. Then we 
explain three different approaches how to solve the sparse solution: 
the l\ approach, the orthogonal greedy approach, and the £ q approach 
with < q < 1. We mainly survey recent results and present some 
new or simplified proofs. In particular, we give a good reason why 
the orthogonal greedy algorithm converges and why it can be used to 
find the sparse solution. About the restricted isometry property (RIP) 
of matrices, we provide an elementary proof to a known result that 
the probability that the random matrix with iid Gaussian variables 
possesses the PIP is strictly positive. 

1 The Research Problem 

Given a matrix $ of size m x n with m < n, let 

TZ k = {$x,xG R", ||x|| < k} 

be the range of $ of all the k-component vectors, where ||x||o stands for the 
number of the nonzero components of x. 
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Throughout this article, $ is assumed to be of full rank. For a vector 
y G IZk, we solve the following minimization problem 

min{||x|| , xGtMx = y}. (1) 

The solution of the above problem is called the sparse solution of y = $x. 

It is clear that the above problem can be solved in a finite time. Indeed, 
write $ = [0i, 02, ' ' ' > 0n] with 0; being a m x 1 vector. One can choose 
m columns, say A = [0j i; • • • , <f) im ] from $ to form a m x m linear system: 
At, = y. If A is nonsingular, one can find a solution z. By exhausting all 
m x m nonsingular submatrices from $ and solving all such linear system 
of equations, one can see which solution has the smallest number of nonzero 
entries. 

However, there could be C^ such nonsingular linear systems from $x = y 
which need to be solved. For example, a rectangular matrix $ with entries 
(xj) 1 , i = 0, • • • , m, j = 1, • • • , n for distinct real numbers x^s. Any m x m 
sub-matrix from $ is of full rank. The number C^ grows exponentially fast 
as m and n go to oo. For example, when n = 2m, C^ ~ 2 n . A common 
case m = 512 and n = 1024 needs to solve at least 2 512 linear systems of 
size 512 x 512. This is impossible to do within a hour using current available 
computer. That is, the above method to solve Eq. (1) needs non-polynomial 
time. Are there any other methods to solve the above problem? Before we 
answer this question, let us see why we want to solve the problem in the next 
section. 

Remark 1.1 When m — n, $x = y is a standard linear system and the 
solution is unique if $ is of full rank. We have already known the Gaussian 
elimination method can be used to solve such linear systems. 

Remark 1.2 When m > n, one may not be able to have $x = y. Instead, 
one asks to find x which minimizes the quantity ||$x — y||2, where || • H2 is 
the discrete £2 norm. This is a standard least squares problem. When $ is 
not full rank, one usually solves the following minimal norm solution using 
standard least squares methods. That is, find x such that 

min{||x|| 2 , xGS A CM n ,}. (2) 

and 

S A := {x G R n , ||$x - y|| 2 = min ||$z - y|| 2 }. (3) 
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The solution can be found by using the pseudo inverse or using the singular 
value decomposition. 

Remark 1.3 When each column of $ is normalized to be 1, $ is called a 
dictionary. When $$ T = I m with identity matrix I m of size m x m, $ is 
called a tight frame. We shall use these two concepts in later sections. 

2 Why do we find the sparse solutions? 

In this section we give several reasons why we want to solve the sparse solution 
of underdetermined systems of linear equations. 

2.1 Motivation: Signal and Image Compression 

This is the most direct and natural application. Suppose that a signal or an 
image y is represented by using a tight frame $ of size m x n with m < n. 
We look for a sparse approximation x satisfying 

min{||x|| , x G R n , ||$x - y|| < 0}, (4) 

where 6 > is a tolerance. In particular, for lossless compression, i.e., 6 = 0, 
the above (4) is the our research problem (1). 

2.2 Motivation: Compressed Sensing 

We are interested in economically recording information about a vector x 
in R n . First of all, we allocate m nonadaptive questions to ask about x. 
Each question takes the form of a linear functional applied to x. Thus, the 
information we obtain from the questions is given by 

y = $x, 

where $ is a matrix ofmxn. In general, m is much smaller than n since x is a 
compressible data vector. Let A be a decoder that provides an approximation 
x* to x using the information that y holds. That is, Ay = x* ps x. Typically, 
the mapping A is nonlinear. The central question of compressed sensing is 
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to find a good set of questions and a good decoder ($, A) so that we can find 
a good approximation x* of x. See, e.g. [Candes'06]. 

For example, when x G R n with ||x|| < k << n, one wants to know 
which components of x are not zero and what values are. We can design 
some vectors $ to question x by inner product. Thus, we get y = $x G TZk- 
To find x, we solve our research problem (1). 

For another example, suppose that a data vector z is a set of compressible 
data, i.e., there exists a vector x G R n with only k nonzero entries such that 
z = Ax for an invertible matrix A of n x n. Suppose that the questions can 
be represented in the form y = Cz with m x n matrix C. Since z G TZk, i.e., 
z = Ax, we have 

y = CAx. (5) 

Certainly it is necessary to question z m times with m > k. That is, we have 
k < m < n. 

If C is chosen in the form QA~ l for some rectangular matrix $ of size 
mxn, we need to solve the following minimization problem in order to record 
the data z economically. 

y = Cz = fA^Ax = $x. (6) 

The problem is to find the sparse representation x satisfying the above (6) 
which is the same as (1). 

2.3 Motivation: Error Correcting Codes 

Let z be a vector encoded x by a redundant linear system A of size mxn 
with m > n. That is, z = Ax is transmitted through a noisy channel. The 
channel corrupts some random entries of z, resulting a new vector w = z + v. 
Finding the vector v is equivalent to correcting the errors. 

To this end, we extend A to a square matrix B of size m x m by adding 
A -1 , i.e, B = [A-^A 1 -]. Assume that A satisfies A T A = I n , where I n is the 
identity matrix of n. Then we can choose A L such that BB T = I m the 
identity matrix of size m. Clearly, 



B T w = B T z + B T w 



+ 



A T v 

(A^f, 



Let y = (A ± ) T v which is the last m — n entries of 5 T w. Since z is in the 
codeword space V which is a linear span of columns of the matrix A, (A ± ) T v 
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is not in the codeword space and is the only information about v available 
to the receiver. 

If the receiver is able to solve the minimization problem Eq.(l) with 
$ = (A ± ) T . That is, find the sparsest solution v such that y = (A ± ) T v. 
Then we can get the correct x. Thus, this error correcting problem is again 
equivalent to the sparsest solution problem Eq. (1). See [Candes and Tao'05] 
and [Candes, Romberg, Tao'06] for more detail. 

2.4 Motivation: Cryptography 

Although large prime numbers are currently used for secure data transmis- 
sion, it is possible to use underdetermined systems of linear equations instead. 
The ideas can be described as follows. Suppose that we have a class of ma- 
trices $ of size m x n with m < n which admit a computationally efficient 
algorithm for solving the minimization Eq.(l) for any given y which is in 
the range TZk- Let \& be an invertible random matrix of size m x m and 
A = ^$. Suppose that a receiver wants to get a secret data vector x from a 
customer, e.g., a vector consisting of credit card number, expiration date, and 
the name on the credit card. The receiver sends to the customer the matrix 
A in a public channel. After receiving A the customer computes z = Ax and 
sends z to the receiver in a public channel. As we mentioned above, finding 
the sparse solution x from z using matrix A is non-polynomial time. With 
overwhelming probability, such x can not be found by other parties. 

However, the receiver is able to get x by solving y = \I/ _1 z = $x which is 
our research problem Eq. (1). By changing \1/ frequently enough, the receiver 
is able to get the secured data every time while the hacker is impossible to 
decode the data. 

2.5 Motivation: Recovery of Loss Data 

Let z be an image and z be a partial image of z. That is, z loses some 
data to become z. Suppose that we know the location where the data are 
lost. We would like to recover the original image from the partial image 
z. Let $ be a tight wavelet frame such that x = $z is the most sparse 
representation for z. Let \l/ be the residual matrix from $ by dropping off 
the columns corresponding to the unavailable entries, i.e., the missing data 
locations. Note that $ T $ = I m and hence, \l/ r \l/ = 1^ with £ < m. 
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It follows that \& T x = ^/ T $z = z by the ort honor mality of columns of 
$. Thus, we need to find the sparsest solution x from the given z such that 
z = \l/ T x which is exactly the same problem as our research problem(l). Once 
we have x, we can find z which is z = $x. See [Aharon, Elad, Bruckstein'06] 
for numerical experiments. 

3 The £i Approach 

Although the problem in Eq. (1) needs a non-polynomial time to solve (cf. 
[Natarajan95]) in general, it can be much more effectively solved by using 
many other methods, e.g., i 1 minimization approach, reweighted i 1 method, 
OGA(orthogonal greedy algorithm), and the £ q approach. Let us review these 
approaches in the following subsections and following sections. 
The l\ minimization problem is the following 

minfllxHx, x£l" 1 fe = y}, (7) 

where ||x||i = Y^h=i \ x i\ f° r x = ( x ij x 2,-'' , x n) T - The solution Ai$x is 
called the l\ solution of y = $x. Since the t\ minimization problem is equiv- 
alent to the linear programming, this converts the problem into a tractable 
computational problem. (See [Lai and Wenston'04] for a justification of the 
equivalence and a computational algorithm for £i minimization.) A matlab 
£i minimization program is available on-line. 

But one has to study when the (PI) solution (the solution of Eq. (7)) 
is also the (PO) solution (the solution of Eq. (1)). There are two concepts: 
mutual coherence(MC) and restricted isometric property (RIP) of the matrix 
$ to help describe the situation. 

3.1 The Mutual Coherence 

Let us begin with the spark of matrix A, the smallest possible number a such 
that there exists a columns from A that are linearly dependent. It is clear 
that cr(A) < rank(A) + 1. The following theorem is belong to [Donoho and 
Elad'03]. 

Theorem 3.1 A representation y = $x is necessarily the sparsest possible 
if ||x||o < spark(§)/2. 
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Proof. Suppose that there are two sparse solutions x 1 and x 2 with 
|| ^c 1 1| o < k and ||x 2 ||o < k solving y = $x. Then $(x x — x 2 ) = 0. So 
|| (x 1 - x 2 )|| < 2k but, || (x 1 - x 2 )|| > spark($). It follows that k > 
spark($)/2. Hence, when k < spark(<&)/2, the sparsest solution is unique. 
That is, if one find a solution x of $x = y with ||x|| < spark($)/2, then x 
is the sparse solution. ■ 

Next we introduce the concept of mutual coherence of matrix $. Assume 
that each column of $ is normalized. That is, $ is a dictionary. Let G = $ T< 1 ) 
which is a square matrix of size n x n. Write G = (9ij)i<i,j<n, the mutual 
coherence of $ is 

M = M($) = max \g i:j \. 

l<i,j<n 

Clearly, M < 1. We would like to have matrix $ such that its mutual 
coherence M is as small as possible. 

However, M (<£>) can not be too small. We have 



Lemma 3.2 If n > 2m, then M($) > (2m 



1-1/2 



Proof. Indeed, let Xi,i = 1, • • • ,n be eigenvalues of G. Since G is 
positive semi-definite, all Aj > 0. Since the rank of G is equal to m, only m 
nonzero Aj. Since Y2i \ is equal to the trace of G which is n since ga = 1. 
That is, 



n 



5><v^ 4 /£a?. ( 8 ) 



On the other hand, using a property of the Frobenius norm of G, we have 

(9) 





V x 2 — urn 2 - 


E 


(^) 2 - 






i 


l<i,j<i 


t 




It follows from Eq. (8" 


) and (9) that 








(n 2 - 


-n)M(®) 2 + n> 


E 

l<i,j'<r 


(^) 2 > 


n 2 
m 



That is, M(<&) > J ^in-i) - ^ n particular, when n > 2m, we have M($) > 

(2m)- 1 / 2 . That is, M($) e ((2m)- 1 / 2 ,!]. ■ 

With M(<&), we can prove the following (cf. [Donoho and Elad03]) 
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Theorem 3.3 Let Spark(Q) be the spark of $ and M($) be the coherence of 
$. Then 

Spark(<$>) > 1/M($). 

Next we need the following lemma. 

Lemma 3.4 Let k < 1/M + 1. For any S C {1, • • • ,n} with #(£) < k and 
$5 be the matrix consisting of the k columns of $ with column indices in S. 
Then the kth singular value of &$ is bounded below by (1 — M(k — l)) 1 ^ 2 and 
above by (1 + M(k- l)) 1 ' 2 . 

Proof. For any vector v G R™ with support on S, we have 
V T GV = V5$|$ 5 V 5 = ||v|| 2 + ^2 v i9ij v j- 






Since 



ijes ijes 



J2 v i9ij v j\\ < M J2 



< M ! > \viVj\ - \\v\\l 

< M||vj| 2 (A;-l), 



we have 



v 5 $f$ 5 v 5 > ||v|| 2 - M(k - l)||v|| 2 = (1 - M(k - l))||v|| 2 . 
Similarly, 

v T Gv = v 5 $f $ 5 v 5 = ||v|| 2 + ^ v i9ijVj 

< ||v|| 2 + M(k - l)||v|| 2 = (1 + M(k - l))||v|| 2 . 

These complete the proof. ■ 

We first show that if k < (1 + 1/M)/ 2 and for any y G TZk, the sparse 
solution of Eq. (1) is unique. 

Lemma 3.5 Suppose k < (l + l/M)/2. For any y G TZk, the sparse solution 
of Eq. (1) is unique. 
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Proof. Let y = $Xo G TZk- Let xi be a solution of Eq. (1) with 
|| x i||o < k. Then ||xo — xi||o < 2k. By Lemma 3.4, we have 

(1 - M(2k - l))||x - Xl ||l < ||$(x - Xl )||l = 0. 

Since 1 - M(2k - 1) ^ 0, it follows that x = x x . ■ 

In order to fully understand the computation of Eq. (7), we next intro- 
duce a variance of Eq. (7): solve the following minimization problem 

mindlxHi, xGK r \||$x-y|| 2 <5}, (10) 

where y = $x + -2 with ||^|| 2 < e < 5 and x G M n is a vector with k nonzero 
entries, that is, $x G TZk- Hence, we consider the case that y has some 
measurement error and compute a solution x within accuracy 5. 

Theorem 3.6 Let M be the mutual coherence of $. Suppose that 

k< (l/M+l)/4. 

For any xq with ||xo||o < k, let x e ^ be the solution of Eq. (10). Then 



X e A-X o < 



2/ (e + ^) 2 



1 - M(Ak - I] 



Proof. Write w = x e> 5 — x . Clearly, ||x £j< 5||i = ||tu + x ||i < ||x ||i when 
computing the £± minimization. Let S G {1, 2, • • • , n} be the index set where 
x is supported. Since ||tu + x ||i > ||x ||i — J2i^s \ Wi \ + 2~2i^s \ w *\i we nave 
E is sN <Et 6 srN or 

HHIi < 2^H < 2v / )b||iu||2, (11) 

ies 

where S denotes the complement set of S in {1, 2, • • • , n}. 

On the other hand, ||$x e ^ — y II 2 < $ an d y = ^Xq + z imply that 
\\$w + z\\ 2 < 5. That is, \\$w\\ 2 < \\&w + z\\ 2 + e < 5 + e. 

Finally, \\$w\\l = IMI2 + wT ( G ~ !) w > IMI2 ~ M (IMIi ~ IMI2) > 
(1 + M)||ty||| — M4/u||iy||| by the estimate (11) above. It follows that 

, „9 1 ,,« 1,9 (e + ^) 2 

H <- — 7777 $H 2 < 



1 + M-4M&" uz ~ 1-M(4fc-1) 
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by the estimate in the previous paragraph. This concludes the result in this 
theorem. ■ 

Next we look at the (e, 5) variance of the (PO) problem: to solve the 
following minimization problem 

min{||x|| , xeR n ,||$x-y|| 2 <<$}, (12) 

where y = $x + z with ||z|| 2 < e and $x G 1Z)~. Then we can prove 

Theorem 3.7 Let M be the mutual coherence o/$. Suppose that 

k< (l/M+l)/2. 

For any x with ||x ||o < k, let x ei( 5 be the solution of Eq. (12). Then 



r 1,2/ ( e + 5 ) 

|x ei< 5-Xo|| 2 < 



2 



1 - M(2k - 1) 
Proof. By Lemma 3.4, we have 

||x 6 x — Xnlln < ; r||$(x f x — Xn)||n 

II e,6 0112 - ! _ _kf(2fc - 1) " V ' ; " 2 



1 ll*~ • 112^ ( e + ^ 



1-M(2A;-1)" ' ' " z ~ 1 - M(2k - 1) 

This completes the proof. ■ 

Both theorems above were proved in [Donoho, Elad and Temlyakov'06]. 
In particular, the proof of Theorem 3.7 above is a much simplified version of 
the one in [Donoho, Elad and Temlyakov'06]. It is easy to see that there is a 
gap between the requirements of k. That is, one is to require k < (l + l/M)/4 
by any l\ method and the other is to require k < (1 + l/M)/2 by an £ 
method. Thus, the l\ method is not optimal yet. It is interesting to know 
how we can increase k when using the £± method. 

3.2 RIP 



Another approach is to use the so-called Restricted Isometry Property(RIP) 
of the matrix $. Letting < k < m be an integer and At be a submatrix of A 
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which consists of columns of A whose column indices are in T C {1, 2, • • • , n}, 
the k restricted isometry constant 5 k of A is the smallest quantity such that 

(l-5 k )\\x\\i<\\A T x\\l<(l + 6 k )\\x\\t (13) 

for all subset T with #(T) < k. If a matrix A has such a constant 8 k > for 
some k, A possesses RIP. With this concept, it is easy to see that if 5 2k < 1, 
then the solution of Eq. (1) is unique. Indeed, if there were two solutions x 1 
and x 2 such that 

$(x x - x 2 ) = 0, 

then we choose the index set T which contains the indices of the nonzero 
entries of x 1 — x 2 and see that #(T) < 2k which implies 

(l-5 2fc )||x 1 -x 2 || 2 <||d> T (x 2 -x 2 )|| 2 = 0. 

It follows that Hx 1 — x 2 || 2 = when 8 2k < 1- That is, the solution is unique. 
Furthermore, 

Theorem 3.8 ([Candes, Romberg, and Tao'06]) Suppose that k > 1 such 
that 

hk + 35 4fe < 2 

and let x G R n be a vector with ||x|| < k. Then for y = $x 7 the solution of 
Eq. (7) is unique and equal to x. 

This result is recently simplified slightly in the following way: 

Theorem 3.9 ([Candes'08]) Suppose that k > 1 such that 

5 2k <V2-l. 

Let x G R™ be a vector with ||x|| < k. Then for y = Ax, the solution of Eq. 
(7) is unique and equal to x. In fact 

ll *ll ^ ?(l + p) , *,, 2 

x — X \\o < \\Ax — Ax o H x — x T i. 

1-p 1-p 

where x* zs #ie fPi^ solution (the solution of Eq. (7) and x^ is the vector of 
the k largest components o/x*. i/ere p = ■#- . 
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As we have already known that 82k < v2 — 1 < 1 which implies that the 
sparse solution is unique. The above result mainly explains that the (PI) 
solution (the solution of Eq. (7)) is equal to the (PO) solution (the solution 
ofEq. (1)). 

The results are consequences of the following Theorem 5.2 and hence we 
omit the proofs of the above two theorems here. 

Let us discuss what kind of matrices $ satisfies the RIP. So far there is no 
explicit construction of matrices of any size which possess the RIP. Instead, 
there are a couple of constructions based on random matrices which satisfy 
the RIP with overwhelming probability. In [Candes, Romberg, and Tao'06], 
the following results were proved using the measure concentration technique 
(cf. [Ledoux'01]). 

Theorem 3.10 Suppose that A = [ciij}i<i<m,i<j<n be a matrix with entries 
aij being iid Gaussian random variables with mean zero and variance \j \fm. 
Then the probability 



^(|||&c||l-||x||l|<c||x|||)>l-^)(l + 2 / 



/e) k e~ me2/c . (14) 

for any vector x G R n with ||x||o = k, where c > 2 is a constant and ||x||o 
denotes the number of nonzero entries of vector x. 

Once we choose k < m such that (T)0- + 2/e) fc e~ me / c < 1 small enough, 
we will have a good probability to have a matrix satisfying the RIP. Indeed, 
since (™) < {n/e) k , 

( n \ M _i_ 2/e) k e~ me2 / C < g- me2 / c + feln (n/e)+fcln(l+2/e) 

As long asm> fccm(n(l + 2/e)/e)/e 2 , we have 

P(|||$x|| 2 - ||x|| 2 | < e||x|||) > 0. 

That is, a matrix with RIP can be found with positive probability. 

Theorem 3.10 can be simply proved based on the following theorem (cf. 
[Baranuik, Daveport, DeVore, Wakin'08]). 
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Theorem 3.11 Suppose that A = [aij\i<i< m ,i<j< n be a matrix with entries 
dij being iid Gaussian random variables with mean zero and variance 1/ ' \/m. 
Then for any e > 0, the probability 

2 

P(\\\Ax\\l ~ N2I < e||x||i) > 1 - 2exp( — ), (15) 

where c is a positive constant independent of e and ||x||2 for any x G R n . 

In general, there are many other random matrices satisfying the above 
probability estimate. Typically, matrices with sub-Gaussian random vari- 
ables possess the RIP. See [Mendelson, Pajor, and Tomczak-Jaegermann'07, 
'08]. In addition to the measure concentration approach, there are several 
other ideas to prove the results in the above theorem. For example, [Pisier'86] 
and [Lai'08]. We refer to [Lai'08] for an elementary proof of Theorems 3.11 
and 3.10 and similar theorems for sub-Gaussian random matrices. For con- 
venience, we borrow the proof of Theorem 3.11 from [Lai'08] and present it 
in the Appendix for interested reader. 

3.3 The re-weighted l x Method 

The re-weighted l\ minimization is the following iterations: 

(1) for k — 0, solve the standard t\ problem: 

mia{||x||i, xeK r \Ax = y}, (16) 

(2) for k > 0, find x^ which solves the following weighted l\ problem: 



n 

\X 

mm 

w 

i=l 



inV^, xer,Ax = y}, (17) 

^— t lit- 



with W{ = \x\ ~ I + e for k — 1, 2, 3, • • • , n. 

This method is introduced in [Candes, Watkin, and Boyd'07]. The re- 
searchers gave some heuristic reasons that the algorithm above converges 
much faster than the standard l\ method. It is still interesting why the 
method works better in theory. 
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4 The OGA Approach 

There are many versions of the Optimal Greedy Algorithm(OGA) avail- 
able in the literature. See [Temlyakov'OO], [Temlyakov'03], [Tropp'04], and 
[Petukhov'06]. We mainly explain the optimal greedy algorithm (OGA) pro- 
posed by A. Petukhov in 2006 when $ is obtained from a tight wavelet frame. 
That is, $ is a matrix whose columns are frame components &, % — 1, • • • , n 
satisfying $$ T = I m , where I m is the identity matrix of size m x m. It has 
two distinct advantages: (1) Iterative steps for the least squares solution and 
(2) more than one terms are chosen in each iteration. 

Let A be an index set which is a subset of {1, 2, • • • , n} and A be the 
complement of A in {1, 2, • • • , n}. Also let Pa be the diagonal matrix of size 
n x n with entries to be 1 if the index is in A and otherwise. 

Suppose that we have a fixed index set A. We first introduce a com- 
putationally efficient algorithm for finding coefficients of the linear combi- 
nation f\ = ^2 ie \CLi4>i which is the least squares approximation of /, i.e., 
11/ — /a||2 = mm {||/ — g\\2,g G 5a} where / G R m is a given vector in R m 
and 5a is the span of <f>i,i G A. In general, f\ can be computed directly by 
inverting a Gram matrix [((pi, <Pj)]i,jeK- When m is large, it is more efficient 
to use the following algorithm to find an approximation of f\. 

Algorithm LSA (least squares approximation): Set k = 0,g° = 
f, f° = 0. For k > 1, let g k = g k - l -^P K ^ T g k ~ l and f k = f k - 1 +^P A ^ T g k ~ 1 . 
Stop the iterations when g k — g k ~ x is very small. 

We have the following 
Theorem 4.1 The sequence f k converges to f\ in the following sense: 

||/ fc -/A||2<(l-7 2 ) fe/2 HM|2, 

where 7 is the least non-zero singular value of the matrix $a- 

Proof. We rewrite g k as g k = g\+g k , where g\ is the best approximation 
of g k using the span of columns from $a- Clearly, g° K = f\. For k > 1, 
g\ = g A _1 — ^Pa^^a" 1 since the best approximation operator in R m is a 
linear operator. Similar for f k = f^' 1 + QP^g 1 ^ 1 . Note that f k = f k for 
all k > 0. We have 

f +9a = fl + 9 k A = it l +9 k K l = --- = fi + 9l = fA- 
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It follows that 

\\f* ~ f4* = \\ti\U = \\V ~ *P** T )st%- 

Note that I - $P A $ T = $(/ - P A )$ T and hence 

\\I - $P A $ T || 2 < ||$(7 - P A )$ T || 2 < (1 - 7 2 ) 1/2 - 
Therefore, 

wf-fAh = \\gih <(i-7 2 ) i/2 ibrii 2 

< •••<(i-7 2 ) fc/2 Kii 2 = (i-7 2 ) fe/2 n/Aii 2 . 

This completes the proof. ■ 

We are now ready to present the Petukhov version of orthogonal greedy 
algorithm (OGA). 

Algorithm OGA: Set A = 0, g° = f, f° = 0. Choose a threshold r e (0, 1] 
and a precision e > 0; 

Step 1. For k > 1, find M fc = max^ Afcl |(^ fc_1 , 0i/||0i||)|; and Let A fc = 
A k -iU {i,\(g k -\<f>i/Ui\\)>rM k }; 

Step 2. Apply Algorithm LSA above over A& to approximate g k ~ x to find 
f Ak and g Ak . Update / fc = f^ 1 + f Ak and 5f fe = g^ 1 - f Ak . 

Step 3. If ||/ — / fc || 2 < e, we stop the algorithm. Otherwise we advance 
k to k + 1 and go to Step 1. 

There is lack of theory to justify why the above OGA is convergent in the 
original paper [Petukhov'06] and in the literature so far. We now present an 
analysis of the convergence of the above OGA. 

Theorem 4.2 Suppose that $ of size n x N has the RIP for order k with 
1 < k < n. Then the above OGA converges. 

Proof. Without loss of generality we may assume that A m = {1, 2, • • • , n m } 
for some n m < n, where m — 1, 2, • • • . Let 

G m = [((f>iAj)] 1 < id < nm 
be the Grammian matrix. Define 
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to be the smallest and largest eigenvalues of symmetric G m . The RIP of $ 
for integer n m implies that a m > for m — 1, 2, • • • , m with n mo = n. 
We first observe that the best approximation f\ m = (D" 1 [{f,<f>i)]i <i<n ■ 
Then due to the result in Theorem 4.1, let us for simplicity, assume 
that f\ m is the best approximation of R m -i{f)- We next note that for % e 

|(i? m _ 1 (/),0 l /||</. l ||)|>rM m 



with 



M m = max | (Rm-i(f), 4>i/\\<t>i\\) I = max |(i? m _i(/), &/||&||)| 

iW m _i 1=1, -,n 



> l^a^i^.^/), 



i=i 



for any «j such that Xir=i l a *l — 1- Assume that / = YH=i c i$i with 
SILi l c *l — 1 (with appropriate normalization). It follows that 

M^KiC-iOO^IHI^-iOOII 2 - 

Hence we have 

\\R m (f)\\ 2 = (Rm-i(f) - fA m ,Rm-i(f) - /O = \\R m -i(f)\\ 2 - ||/aJ| 2 

and 

II/aJI 2 = \\Kl[(R m -i(f)^' T 



h 'r^\i=l,■■■ ,n 

1 .... 
> - 

1 



y\\[(R m -l(f),^tl,-,n m \\ 2 



> ^XJ^-i(/)|| 2 . 



That is, 



\R m {f)f = ||iW/)|| 2 - II/aJI 2 < ||i? m -i(/)|| 2 - -^rXJI^-i(/)U 2 - 



Summing the above inequality over m — 1, • • • , k, we get 

l|/W)H 2 < l|i?o(/)|| 2 -r 2 £ ^-^l^^a)!! 2 < ||/|| 2 -r 2 J2 ^-n 2 m \\R k (f)\\ 2 . 



m=l m=l 
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because of the monotonicity of ||i? m (/)||. In other words, 

k 

dr. 



k 

(^ 2 E— <+i)ii^(/)ii 2 < 

< • n 



m=l 



As Ylm,=i n m diverges and a m nonincreases, ||-Rfc(/)|| has to converge to zero. 
This completes a proof of the convergence of this OGA. ■ 

The OGA can be used to solve our research problem Eq. (1). For y e TZk, 
the OGA algorithm uses the indices which are associated with the terms 
| (y, fa)\,i G {1, 2, • • • ,n} which is > r% of the largest value. As the size of 
Aj increases, it finds an approximation xoga,<e such that Qxoga,€ is closed to 
y within the given e. That is, ||$xoga,<e — y|| < €. 

We now explain why ~X-oGA,e is a good approximation of x. Due to the 
construction, the number of nonzero entries ||xoGA,e||o — k* << n - Similar 
to the RIP, let «&, (3\. > be the best constants in the inequalities 

ctfclNh < ||$z|| 2 < /?fe||z|| 2 , for all ||z|| < k. 
Then since ||x — x OGj 4 i(E || < k + k* , 

||x - X GA, e || 2«fc+fc* < ||$(x - XOGA,e)h = \\y ~ $X GA,j2 < €. 

That is, we have 

II II ^ e 

|| X — X OGA,e||2 S • 

ttfc+fc* 

In particular, when k* < k, all we need is to assume that a<ik > and 
hence xoga,6 is away from x by efoiik- 

Next we need to show that k* < k may happen. Assume that each 
column of $ is normalized. For y = $x with x = (xi, £2, • • • , x n ) T , without 
loss of generality, we may assume that the support of x is S — {1, 2, • • • , k}, 
\xk\ = min{|xj| ^ 0, i — 1, • • • , n}, and |xi| = Hx^. 

Suppose that 

^4 < 18) 

where M = M(<&) stands for the mutual coherence of $. Then we can claim 
that the support (xoGA,e) C support(x). Recall $ = [fa,--- ,(f> n ] with fa 
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being the ith column of $. Let us first compute the inner products of y with 

0i's. 

k 

\(y,<f>i)\ = K$x,&)l = l^ixjfafa)]. 

and 

k k 

\^2( x j)<PjAi)\ > \(xi<t>i,<i>i}\ -^2\(xj<i>jAi}\- 

j=l 3=2 

In particular, we have 

|(y,0i)|>ki|-M(A;-f)|x 2 |. 
and 

k 

\{y,<f>i)\< |^(^0„0,)|<|a;i|A;M. 

By our assumption in Eq. (18), we have 

|xi| - M(k - l)\x 2 \ > \xi\ - M(k - l)|xi| > \xi\kM. 
it follows that 

|(y>i)|>|(y,^)|, V*>2. 

That is, the largest inner product is |(y,0i)|. 
Furthermore, let us assume 

1 f\Xb\ \ 

k < t r— f-T + M ) ot rk\xAM < \x k \ - M(k - l)|a;i I, (19) 

(1 + r)M \ \xi\ J 

where r is the positive constant r < 1 employed in the OGA. 

Then for 2 < j < k, \{y,<j)j)\ > \x j\ - M(k -l)\xi\ > \x k \ - M(k - l)|xi| 
and r|(y,0i)| < rk\x\\M. It follows that \{y,(pj)\ > r\{y, <f>i)\- That is, the 
first greedy step in the above OGA picks up all the indices of the nonzero 
entries of x. 

In particular, when the nonzero entries of x are 1 in absolute value, the 
condition in Eq. (19) is simplified to 

k< - 3_ — (1 + M). 

- 1 + r M v ; 
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That is, if k satisfies Eq. (18), then k satisfies Eq. (19). Under the condition 
in (18) or the conditions in (18) and (19), the OGA picks all the entries 
0i, • • • , 0fc. Hence, the support (^oGA,e) is the same as the support of x. 

Furthermore, ||x OGj 4, e - x||i < v^||x* - x|| 2 . Since \\$(x GA,e - x)|| 2 = 
||*x OGj4 , e - y|| 2 < e, we have 

e 2 > ||$(x GAe-x)||2 

= \\*OGA,e - xf + (XOGA,e ~ X) T (G - I)(XOGA,e ~ x) 
> \\*OGA,e - 3CJ|| - M(\\x GA,€ - X||i - \\XOGA,e ~ X|||) 

= (1 + M)||x OGA)£ - x||| - M/c||x OGAe - x|||. 



That is, 



2 



| X G A,e _ X ll2 ^ 



e 



2 



12 ~ l-M(Jfe-l)' 

That is, under the assumption that the sparsity of x is small, i.e., Eq. (18), 
xoGA,e approximates the sparse solution x very well. 

4.1 L\ Greedy Algorithm 

Recently, Kozlov and Petukhov proposed a new greedy algorithm (cf. [Kozlov 
and Petukhov'08]). It is called L\ Greedy Algorithm. The algorithm starts 
with the solution of the l\ minimization under the constraint Azo = y. 

[1] Let z be the solution of the l\ minimization under the constraint Az = y 
among z G R". 



[2] Let M = \\z 



on 



[3] For % — 1, • • • , N, let W e R™ be a weighted vector with 1 in the all 
entries except for those entries which are 1/10000 when |z* _1 | > 0.8M, 
1 < j < n. 

[4] Solve the weighted l\ minimization problem 

n 

min{^ \zi\/wi, Az = y, z € R n } 

3=1 

and let z % be the solution. 
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[5] If z* is not yet a sparse solution, let M = 0.8M and return to Step 3. 

The algorithm works well for random matrix A of size 512 x 1024. It is in- 
teresting to give an analysis of the convergence or reasons why the algorithm 
works. 



5 The £ q Approach 

Let 






\q 



be the standard £ q quasi- norm for < q < 1. It is easy to see that lim ||x||^ = 

||x||o. We can use ||x||^ to approximate ||x||o. Thus, we consider the following 
minimization 

min{||x||«, xGl n ,k = y}. (20) 

for < q < 1 as an approximation of the original research problem Eq. (1). 
A solution of the above minimization is denoted by A g $x. 

5.1 Recent Results on the £ q Approach 

The several £ q methods were studied recently in [Chartrand'07], [Foucart and 
Lai'08], [Davies and Gribonval'08] and [R. Saab and O. Yilmaz'08]. The first 
piece of results is shown in [Chartrand'07] 

Theorem 5.1 Let q e (0, 1]. Suppose that there exists a k > 1 such that the 
matrix $ has RIP constant such that 

5 ks + k 2 ^-\ k+1)s < k 2 '*- 1 - 1. 

Then the solution of Eq. (20) is the sparest solution. 

One can see that this result is a generalization of Theorem 3.10. When 
q — 1 and k — 3, the above condition is the condition in Theorem 3.10. 
In fact, the proof is a generalization of the proof in [Candes, Romberg, and 
Tao'06] for £i norm to £ q quasi-norm. In [Foucart and Lai'08], we felt that 
the non- homogeneity of the Restricted Isometry Property (13) contradicted 
the consistency of the problem with respect to measurement amplification, 
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or in other words, that it was in conflict with the equivalence of all the linear 
systems (cA)z = cy, c G R. Instead, we introduce ak,/3k > to be the best 
constants in the inequalities 

afclMh < ll^zlh < /3 fc 1 1 z 1 1 2 , ||z|| < k. 

Our results are to be stated in terms of a quantity invariant under the change 
A <— cA, namely 

72s := o - L 

Oils 1 

In fact, a\ = \ — 5k and /3| = 1 + Sk- We use this slightly modified version of 
RIP and work through the arguments of [Candes, Romberg, Tao'06] in terms 
of quasi-norm £ q to get the following theorem. 

Our main result in this section is the following (see [Foucart and Lai'08] 
for a proof) 

Theorem 5.2 Given < q < 1, if 

72* — 1 < 4(v2 — 1)1-1 for some integer t > s, (21) 

then every s-sparse vector is exactly recovered by solving Eq. (20). 

Corollary 5.3 Under the assumption that 

7 2s < 4V2 - 3 w 2.6569, (22) 

every s-sparse vector is exactly recovered by solving (7). 

When q = 1, this result slightly improves Candes' condition in Theorem 
3.9, since the constant 7 2s is expressed in terms of the Restricted Isometry 
Constant 62s as 

1 + s 2s 

72s = Z 7—, 

1 -^2s 

hence the condition (22) becomes 82s < 2(3 — v2)/7 ~ 0.4531. 

The second special instance we are pointing out corresponds to the choice 
t — s + 1. In this case, Condition (21) reads 

, lX 1/9-1/2 
72s+2 < l + 4(v / 2-l) (l + - 

The right-hand side of this inequality tends to infinity as q approaches zero. 
The following result is then straightforward. 
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Corollary 5.4 Under the assumption that 

l2s+2 < +00, 

every s-sparse vector is exactly recovered by solving (20) for some q > small 
enough. 

The key point is to show for any v which is in the null space of $, i.e, 
$v = 0, ||vg|| g < llvgllg unless v = 0, where S stands for the index set of the 
nonzero entries of the solution x e TZk , vg denotes the vector v restricted in 
S with other entries being zero and S is the complement indices of S. 

This is indeed the case since for v = x — x* in the null space of <5, 
|| v sl|g < ll v s||g) where x* is the solution of Eq. (20) and x is the sparse 
vector supported on S satisfying $x = y. Combining the above inequality, 
we have a contradiction that ||vg|| g < ||vg|| g unless v = 0. Thus, the solution 
of the minimization is the exact solution if we can show ||vg|| g < ||vg|| ? . This 
inequality was recognized in [Grinoval and Nielson'03]. The condition (21) 
in Theorem 5.2 implies this inequality. 

5.2 More about the £ q Approach 

We first show that the minimization problem Eq. (20) has a solution for 
q > 0. That is, the existence of the solution is independent of the RIP of <5. 
See [Foucart and Lai'08] for a proof. 

Theorem 5.5 Fix < q < 1. There exists a solution A g Ax solving Eq. 
(20). 

We next consider the situation that the measurements y are imperfect. 
That is, y = $Xo + e with unknown perturbation e which is bounded by a 
known amount ||e|| 2 < 9. In this case we consider the following 

min{||x||«, xeR n ,||$x-y|| 2 <0}. (23) 

A solution of the above minimization is denoted by A 9i e$x. As in the previ- 
ous section, we have 

Theorem 5.6 Fix < q < 1 and 9 > 0. There exists a solution A^c&x 
solving Eq. (23). 
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In [Saab and Yilmaz'08], they extended the proof in [Candes'08] in the 
£ q setting. They have 

Theorem 5.7 Let q G (0, 1]. Suppose that 5 ks + k 2lq ' l 5 {k+1)s < k 2 / q ~ l - 1 
for some k > 1 with kS G Z + . Let x* be the solution of Eq. (23). Then 

C 

II Y Y*ll g < CrP I A (y'W 

||X X || 2 s Oi?7 + ji^^Wj 

for two positive constants C\ and C 2 - 

Here, the quantity A*.(x) g denotes the error of best fc-term approximation 
to x with respect to the £ g -quasinorm, that is 

A fe (x) g := inf ||x-z|| g . 

||z||o<fe 

The above theorem is an extension of Chartrand's result (cf. Theorem 5.1). 
Next we state another main theoretical result of this survey. We refer to 
[Foucart and Lai'08] for a proof. 



Theorem 5.8 Given < q < 1, if Condition (21) holds, i.e. if 

/ t \i/H/2 
lit ~ 1 < 4(v2 — 1)1-1 for some integer t > s, (24) 

then a solution x* of (P q ,e) approximate the original vector x with errors 

||x-x*|| g < d-a^), +D 1 -s 1/q ~ 1/2 -e, (25) 

l|x-x*|| 2 < C 2 -0f- 2+ D 2 .e. (26) 

The constants C\, C 2 , D\, and D 2 depend only on q, ^t, and the ratio s/t. 

Comparison of the results in Theorems 5.8 and 5.7 is given in [Saab and 
Yilmaz'08]. It concludes that when k is around 2, the sufficient condition 
(24) is weaker while the condition in Theorem 5.7 is weaker when k > 2. 
Numerical experiemental results in [Foucart and Lai'08] show that the £ q 
method is able to 100% recovery all the sparse vectors with sparsity is about 
s = m/2. That is, in order to have ks < m, k is about 2. 

Next we consider that negative results discussed in [Davies and Gribno- 
val'08]. That is, when the £ q method may fail. 
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Theorem 5.9 For any e > 0, there exists an integer s and dictionary $ with 
a restricted isometry constant 82s < l/v2 + e for which l\ method fails on 
some k sparse vector. 

Now the gap between the positive result 82s = 2(3 — v / 2)/7 = 0.4531 and 
the negative result 82s = I/a/2 + e = 0.7071 is about 0.2540. In general, 
Davies and Gribnoval consider a special matrix $ which has a unit spectral 
norm, i.e., 

,, ,, ll^yL 

11$ || 2 = SU P -fl n — 1- 

y^o llylh 

Then they define 

„>(*) := mm » 

!/e y 2 

||y|| <fc IIJ l|Z 

which is equal to ct\ in [Foucart and Lai'08]. 

Theorem 5.10 Fix < q < 1 and let < i] q < 1 be the unique positive 
solution to 

r] ^ + l = -(l-r ]p ). 

For any e > 0, there exist integers s > 1, N > 2s + 1 and a minimally 
redundant unit spectral norm tight frame §n-ixn with 

°i s m > 1 - -%« - c 

for which there exists an s-sparse vector which cannot be uniquely recovered 
by the £ q method. 

5.3 Numerical Computation of the £ q Approach 

The minimization problem (P q ) suggested to recover x is nonconvex, Follow- 
ing [Foucart and Lai'08], we introduce an algorithm to compute a minimizer 
of the approximated problem, for which we give an informal but detailed 
justification. 

We shall proceed iteratively, starting from a vector z satisfying Az = y, 
which is a reasonable guess for x, and constructing a sequence (z n ) recursively 
by defining z ra+ i as a solution of the minimization problem 

N 1 1 

minimize \_. 71 r~T — vT77 subject to Az = y. (27) 



zm N 



-j (\ z .1 j_ e u- 
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Here, the sequence (e n ) is a nonincreasing sequence of positive numbers. It 
might be prescribed from the start or defined during the iterative process. 
In practice, we will take lin^^oo e n = 0. We shall now concentrate on con- 
vergence issues. We start with the following 

Proposition 5.11 For any nonincreasing sequence (e„) of positive numbers 
and for any initial vector z satisfying Az = y, the sequence (z n ) defined by 
(27) admits a convergent subsequence. 

Similar to the proof of Theorem 5.5, we can see that the solution of the 
above minimization exists. We further show that the solution x e of Eq.(27) 
will converge to the solution Eq. (20). For convenience, let x be a solution 
ofEq. (20). 

Theorem 5.12 Fix < q < 1. Let x e be the solution of Eq. (27). Then x e 
converges to Si as e — ► + . 

The new minimization problem Eq. (27) can be solved using l\ method 
since .F 9)£ (x) is a weighted l\ norm. 

Proposition 5.13 Given < q < 1 and the original s-sparse vectors, there 
exists i] > such that, if 

e n < rj and \\z n — x||oo < r\ for some n, (28) 

then the algorithm 27 produces the exact solution. That is, 

Zfc = x for all k > n. 

The constant rj depends only on q, x ; and 72 S . 

Lemma 5.14 Given < q < 1 and an s-sparse vector x ; if Condition (21) 
holds, i.e. if 

/^n 1/9-1/2 

72* — 1 < 4(v2 — 1)1-1 for some integer t > s, 

then for any vector z satisfying Az = y, one has 

||Z X|| g S ^ [\\ z \\q ll x ll g J> 

for some constant C depending only on q, r )2t, and the ratio s/t. 
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Figure 1: Comparison of 



1) ^qi 



and OGA methods for sparest solutions 



Finally we present the following 

Proposition 5.15 Given < q < 1 and the original s-sparse vector x, if 
Condition (21) holds, i.e. if 



72< - 1 < 4(V2 - 1) ( - 



l/q-l/2 



for some integer t > s, 



then there exists ( > such that, for any nonnegative e less than (, the vector 
x is exactly recovered by solving (27). The constant ( depends only on N, q, 
x, j2t, and the ratio s/t. 

Numerical results show that our £ q approximation method works well. In 
Figure 1, we present the frequencies of the exact recovery using various meth- 
ods for Gaussian random matrix of size 128 x 512 for various sparse vectors. 
For each sparsity, we randomly generate the Gaussian random matrix $ and 
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a vector x with the given sparsity and tested various methods to solve the x 
for 100 times. The number of exact recovery by each method is divided by 
100 to obtain the frequency for the method. 
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6 Appendix 1: Gaussian Random Matrices 

Let A = [dij]i<i<m,i<j<n be a rectangular matrix with a^- being iid Gaussian 
random variables with mean zero and variance a 2 . Let x = (x±, • • • , x n ) T G 
R n be a vector. We use ||x|| 2 denotes the norm of x. Consider a random 
variable X = (X 1; • • • , X m ) T with JQ = (X^?=i a ij x j) 2 , i = 1,- • ' ,m. Since 
E(ay) = 0, we have EpQ) = cr 2 ||xj|| for all i. Let & = JQ — EpQ) be a new 
random variable and let 



m — / j si 



i=l 

be the sum of these new independent random variables. It is easy to see that 

S m = \\Ax\\l -ma 2 ||x|| 2 . 
In this section, we are interested in proving the following inequality. 
Theorem 6.1 For any e > 0, the probability 

2 

V(\\\Ax\\ 2 2 - ma 2 ||x||^| < e\\x\\ 2 2 ) > 1 - 2exp(- ? ' ™ ^), (29) 



where c is a positive constant independent of e and ||x| 



2- 



We plan to use the Bernstein inequality (cf. [Buldygin andKozachenko'OO, 
p. 27]) to prove this result. For convenience, we state the inequality below. 

Theorem 6.2 Suppose that £«, 1 < % < m are independent random variables 
with E(&) = and E(^ 2 ) = u 2 < oo,l < i < m. Let S m = YT=i&- 
Moreover, suppose that there exists a constant H > such that 

771 ' 

|E(tf)| < Y u * Rk ~ 2 ^ 

30 
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for all integer k > 1 and all % — 1, • • • ,m. Then the following inequality 
holds for all t > 0: the probability 



V{\S m \ >t)< exp 



t 2 



%tH + E™! v\ 



Proof. (The proof of Theorem 6.1.) We need to study Eq. (30) for 
£i = Xi — E(Xj) for k > 3 since for k = 2, Eq. (30) is satisfied trivially. 

For convenience, let // = E(JQ) = cr 2 ||x:|||. It is easy to see E(|^| 2 ) = 2/x 2 . 
Thus, v\ = 2fi 2 . For k > 3, we have 



E(i6i fc ) = m*i - v) k ) = E (*)e(x/)(-i)*-^ 



Let us spend some effort to compute E(X/). We have 
E(X>) = E(£ OijXi)* = J2 J^Tf^Ki^ • • • <»)# ■■■<■ 

3 = 1 jl+-+jn=2j Jl ' Jn ' 

Note that E(o| ,■) = for all odd integers £ and it is known (using integration 

by parts) that E(of) = , r/ for even integers £. Since a^ are iid 

J 2 t ' 2 {i/2y. 

random variables, we have 

EW) = e /o- J 2j) ; 9 ■. ekw-^)^ 1 -^- 

j\ 2-< (2j 1 )!-..(2j n )! 2^!... Jn ! l lj 

(2j)!o*- " 



2jj! " " 2 2-?j! 



Thus, 



E (i«.i^£(-)§fW-'- 



31 
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(27')! 

By using Stirling's formula, we have — — < 2- ? j!/2 < 2 3 k\/2 and hence, 

h- nk,,k ^-r» 4||^||4 y qfc-2/ 2||„||2\Jfc-2 

— — o U = — 2<7 x L — 6 (a x , 

2 2 2 2 2 

£■1 

<-- 9^- 4 ll v ll 4 w fc - 2 

\ Za X nil 

- 2 II 112 

with H = 13.5cr 2 ||x:|||. That is, Eq. (30) is satisfied for k > 3. By 
Theorem6.2, we have 

P(||Ux|| 2 -mc7 2 ||x|| 2 | >t) <2expi -1. (31) 

v 2 " " 2l ; ~ F \ 2(tl3.5/i + 2m/x 2 ) J V ; 

Choosing £ = e||x|| 2 , we have £13.5/i + 2m/i 2 = <7 2 ||x|||(13.5e + 2ma 2 ) and 
the above probability yields: 

1 € in 1 

P(|||A* - ^llxllH > « M S) < 2ex P {- 2(m<j2)(1356 + w) } . (32) 

In other word, the desirable result of Theorem 6.1 is proved. ■ 

We remark that when a = 1/y/m, the estimate Eq. (29) gives a proof of 
Theorem 3.11. For this special case, we have 

Theorem 6.3 Suppose that £ is a Gaussian random variable with mean zero 
and variance a 2 . Let A be an m x n matrix whose entries are iid copies of 
£. For any e > 0, the probability 

P(|px||2-m(T 2 ||x||2| < erwx 2 ||x|| 2 ) > l-2exp< -— I, (33) 

where c is a positive constant independent of e and ||x|| 2 . 

Proof. (The Proof of Theorem 6.3.) For a random matrix A of size 
m x n with entries a^- being iid Gaussian random variables with zero mean 
and variance a 2 , then we use ema 2 for e in Eq. (32). Then we have 

f cm 1 

P(||Ux|| 2 -mc7 2 ||x|| 2 | > emcr 2 ||x|| 2 ) < 2exp<^ } . (34) 

Mil H2 II H2| II 112; - f^ 2(el3.5 + 2)J v ; 

This completes a proof of Theorem 6.3. ■ 

32 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS, VOL.8, NO.2, 328-335, 2010, COPYRIGHT 2010 EUDOXUS PRESS, LLC 



PATTERNS P 



0cPc(QxQ)n([O,l]x[O,l]) 
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ABSTRACT. We will find, from an approximation point of view, patterns, from uniform patterns to 
fractal patterns. 



1. RATIONAL NUMBERS. 



Given the set 



P 



pe.'Z,qe.'Z,q^Q>, Z the set of integers. We will consider 



m , 



m = 0,1, 2,.. .n >for n = 1,2,... then 



'n[0,l] = UQ„ 



.! = ! 



In one-dimensional finite elements a 10-element model of [0, l] is illustrated in Fig. 1 where the 
connectivity is important to the global model. 



0.2 



0.4 



0.6 



0.S 



Fig. 1 

In two-dimensional finite elements a model of r0,l]x[0,l] is illustrated in Fig. 2, connectivity 
becomes more complicated (triangular elements is more popular) 
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Fig. 2 
The sets Qand QxQ (and so on) appear similar to all levels of magnification. Then we will follow 

CC oc 

a construction, specifically of Qn[0,l] = l)Q K as a projection of the set (|Q„ X Q„ ■ The set 
Qio x Qio ' s as illustrated in Fig. 3 
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Fig. 3 
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And in the following figures, Fig. 4, Fig. 5, Fig. 6, there are cases for different values of p in 
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Fig. 4 
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Fig. 6 



This sets are specifically the sets of points 



a b 



c c 



a = 0,1, ...,c 
b = 0,l,...,c 
c = l,2,...,p 



,for a specified peZ 4 



Patterns due to transformations defined for functions/ and g of the type 
(a;, y) —>(/(.*, y), g(x, y)J over a pattern, carry new patterns as in the Fig. 7, Fig. 8, Fig. 9, Fig. 

p 
10, applying for example (x, y) — > (x 2 , y 2 J over the set {JQ„ xQ„ 
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Working with other sets for example Farey sequences, we can get more patterns like the following 
in Fig. 11 
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2. NEW PATTERNS FROM PATTERNS. 

To construct a periodic function / : M — > R with the condition f(x + 1) = f(x) we need only to 
have g : M — > M with g(x) = f(x) for x e [0,l)and then f(x) = g(x = x(mod(l)) , where 
x = xmod(l) means that for ieZwe havex = x — k e [0,1). In another way the same thing is 
obtained if we write f(x) = g(x— fxh using the integer part function. 



3. REPEATING A LOCATED PATTERN TO FIND NEW PATTERNS. 



Take the family of lines from the point [0,l] to 
defined as 



-,a 



and from the point [l,l] to 



-,a 



det 









X 


y 


1 





l 


1 


1 








a 


1 


2 







= 0, det 







X 


y i 


1 


l l 


1 

2 


a 1 



= 



that is y = 2x(a — 1) + 1 and y = 2x(l- a) + 2a -1 respectively for a = 0,0. 1,0.2,. ...,1 with the 
restriction x e [0,l)as is shown in Fig. 12. 




Fig. 12 



then y-M = 2(jc-M)(a-l) + land y-[y] = 2(x-[x])(l-a) + 2a-l for 
a = 0,0.1,0.2,....,! produce the pattern as shown in Fig. 13 
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Fig. 13 



4. BIBLIOGRAPHY 

Barnsley, Michael F. , and Hawley Rising. Fractals Everywhere. Boston: Academic Press 
Professional, 1993. 

Fugal D. Lee. Conceptual Waveletes in digital signal processing. 2007 

Mandelbrot, Benoft B. The Fractal Geometry of Nature. New York: W. H. Freeman and Co., 1982 

Peitgen, Heinz-Otto, and Dietmar Saupe, eds. The Science of Fractal Images. New York: Springer- 
Verlag, 1988 

The Derive - Newsletter. Derive Users Group. 71/72. Austria 2008 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS, VOL.8, NO.2, 336-343, 2010, COPYRIGHT 2010 EUDOXUS PRESS, LLC 



A note on the construction of the s/(2, R) 

integral for ordinary differential equations of 

maximal symmetry 

Sibusiso Moyo 

Department of Mathematics, Statistics and Physics, 

Steve Biko Campus, Durban University of Technology, 

PO Box 953, Durban 4000, South Africa. 

e-mail:moyos@dut.ac.za 

March 15, 2009 



We discuss some of the integrability properties of ordinary differential equa- 
tions of maximal symmetry. The relationship between the solution symmetries, 
the fundamental integrals and the sl(2, R) integrals is discussed. A linear com- 
bination of the fundamental integrals also leads to an integral which has the 
sl(2,R) subalgebra. The construction of the sl(2,R) integral for order n > 9 
can be tedious and hence requires a more compact generating function. Here 
an illustration of such a construction is given and the integral is constructed 
from the direct integration of the differential equation after multiplying by one 
of the autonomous integrating factors. It is also important to note that symme- 
tries and first integrals form the underlying mathematical basis for the algebraic 
theory of integrable equations. 

Keywords: Symmetry Lie-algebra, maximal symmetry, Integral 

1 Introduction 

We recall that Lie ([l],p 405) showed that the maximum number of point sym- 
metries for second order ordinary differential equations is 2 + 6 and for higher 
order equations (n > 3) is n + 4. In addition he further showed that an nth 
order equation which possesses n + 4 point symmetries is equivalent to j/™' = 0, 
where ^ n > denotes d n /dx n , under a point transformation 

X = F(x,y) Y = G(x,y). 

For n > 3, X = F(x) which is called a fibre preserving transformation [2]. Lie 
showed that every linearisable second-order ordinary differential equation has 
the form 

y" = p{x, y)y' 3 + q(x, y)y' 2 + r(x, y)y' + s(x, y), (1) 
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where the coefficients p, q, r and s satisfy the conditions A = and B = 0, 
where 

A = 2q xy - 3p xx - r yy - 3p x r + 3p y s + 2q x q - 3r x p - r y q + 6s y p 
B = 2r xy - q xx - 3s yy - 6p x s + q y s + 3q y s - 2r y r - 3s x p + 3s y q. 

It is well known that when a symmetry is used to determine a first integral 
for a differential equation, the symmetry provides an integrating factor for the 
equation and remains as a symmetry of the first integral. A considerable amount 
of work on constructing integrating factors for ordinary differential equations 
has been done including the works by Cheb-Terrab and Roche in 1999 [4] and 
Leach and Bouquet in 2002 [6]. From these works one concludes that there is a 
strong link between the existence of symmetries be they local or nonlocal and 
the intcgrability of the given equation or systems of equations. 

2 First Integrals and Integrating Factors 

For purposes of this discussion we give a definition of a symmetry of a differential 

equation as follows: 

Definition: An nth order ordinary differential equation 

E(x,y,y',...,yW) = 0, (2) 

admits the one-parameter Lie-group of point transformations 

x = X{x,y;e) = x + e£(x,y) + 0(e 2 ) 

V = Y(x,y;e) = y + erj(x,y) + 0(e 2 ), (3) 

with infinitesimal generator 

G = £(ar, y)d x + ij(x, y)d y (4) 

if the condition 



G [n] 



E(x,y,y',...,y^) =0 (5) 



holds, where G<- n > is the nth extension of G needed to transform the derivatives 
in E — given by 



GW = G + ]T „W - E ( J J V U+1) ^- J) fl.co ■ (6) 

Here the indices deonote total differentiation with respect to x. (See Bluman 
and Kumei [3].) 

Definition: In addition a first integral I for an equation of maximal sym- 
metry E = y( n > = is defined as 



I = f(y,y',y',...,y (n -^) 
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where 

dl _ df 

Q>% | £7=0 ^*^ \e = 

This means that if /i(x, y, y', y", ..., y( n_1 )) is an integrating factor then 

f =hE(x,y,y',...,yM) lE =o=0- (8) 

3 Some algebraic properties of equations of max- 
imal symmetry 

According to Lie's classification [1] the real unimodular group sl(2, R) is pro- 
vided by the Lie algebras spanned by the vectors 

d x , xd x + yd y , x 2 d x + 2xyd v . (9) 

or 

d x , xd x + yd y (x 2 -y 2 )d x + 2xyd y . (10) 

As an example, we consider the well-known third order ordinary differential 
equation of maximal symmetry 

y'" = (11) 

which has seven Lie-point symmetries given as 



Gi = 


= 9y, 


G 2 = xd y , G 3 = 


- x 2 d. 


Cr4 - 


= yd y 






G 5 -- 


= d x , 


G 6 = xd x + yd y , 


G 7 



d x + 2xyd y 

with algebra 3A 1: {sl(2, R) S A x }. 

Remark The first three are solution symmetries and provide a basis for the 
solution of (11), that is, {l,a;,x 2 }, and y = c\ + xc 2 + x 2 cz- G4 is the homo- 
geneity symmetry to indicate that the equation is autonomous and G5,Gq,Gy 
form the sl(2, R) subalgebra. 

Proposition 1 The autonomous integrating factors for (11) are given by y and 
y" . Using y" as the integrating factor and integrating the resulting expression, 
y"y"' — 0, once leads to \y" 2 = k, where k is a constant of integration. For 
k = and k =/= we have the sl{2, R) subalgebra and the algebra is sl(3, R) : 
2Ai © s {sl(2, R) © A{\ 2A\. It is also a well-known result that all equations 
of maximal symmetry in the form j/™' = have the sl(2, R) element(see [13]) 
and references therein). Otherwise if y" — k is treated as a function we have 
G\ = d y ,G2 = xd y ,Gs = d Xl G^ = xd x + 2yd y . The algebra in this case is 
A\ g : A 2 ® S 2A 1 . 

Proposition 2 Ifyis an integrating factor of the autonomous equation of max- 
imal symmetry y( n > — then the integral obtained using this integrating factor 
will always have the sl(2,R) subalgebra. 
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• In the case of equation (11) using y as an integrating factor and integrating 
the equation obtained once as before gives yy" — \y' 2 = k. This equation 
has the sl(2, R) subalgebra. Infact the integrated equation can be written 
as (y 1 ' 2 )" — , 1 /2- l 3 which is in the form of the Ermakov-Pinney equation 

[7, 8]. A point transformation v = y 1 / 2 leads to the equation v" = k/v 3 . 
In this case k = retains the second order differential equation and k ^ 
gives the point symmetries G\ = d x , Gi = 2xd x + vd v , G% = x d x + xud u . 

• The Ermakov-Pinney equation possesses the thrcc-clcment algebra of Lie 
point symmetries si (2, R) which is characteristic of all scalar ordinary 
differential equations of maximal symmetry (see [10, 12]). 

Proposition 3 The equation of maximal symmetry in the form j/™' = 
will have as one of its autonomous integrating factors y iff n is odd. 

As an example we can easily verify that y'" = 0,j/ t '* > = and y( m ^ = 
will all have y as one of the integrating factors. In the case where n is 
even y does not appear as one of the integrating factors. 

4 Occurrence of s/(2, R) integrals 

For the purposes of this section we shall call the fundamental integrals as those 
that emanate from the solution symmetries of the differential equation. The 
sl(2,R) integrals will be the integrals that arise as a linear combination of 
the fundamental integrals and possess the sl(2, R) subalgebra. For example, 
the integrating factors for (11) are l,x, \x 2 , from which we observe that there 
are also the coefficients of the solution symmetries of the differential equation. 
Associated with each of these are the integrals shown below: 

y 

(12) 



In Flessas et al [14, 15] the numbering of the integrals is according to the one 
given above. 

We consider the fifth order-ordinary differential equation, 

y v - o, (13) 

with autonomous integrating factors y, y" and y™ for illustrative purposes. Mul- 
tiplying (13) by y and integrating the resulting equation gives 

iv I in i 1/2 r /1 a\ 

yy -yy + ^y = J- (14) 



\x 2 y'" = 


h = \x 2 y" - xy' 


x.y'" = 


h = xy" - y' 


l.y'" = 


- h = y". 
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• The integral in (14) for J ^ has the point symmetries 

Gi = d x (15) 

G 2 = xd x + 2yd y (16) 

G 3 = a: 2 ^ + 4st/0 w . (17) 

In the case that J = we just obtain the four point symmetries which are 
Gi = <9 X , G 2 = xd x ,G 3 = ydj, and G 4 = a; 2 ^ + 4xy<9 y . 

It is noted that the integral obtained using the integrating factor y always 
has sl(2, R) subalgebra according to proposition 2. 

Equation (13) has a basis of solutions l,x,x 2 ,x 3 ,x so that each of these 
give us the solution symmetries, 



Gi = 


= 9x 


G 2 = 


= xdx 


G 3 = 


- x dx 


G 3 = 


- x 3 dx 


G 3 = 


= a; <9x 



Associated with these solution symmetries are the fundamental integrals 



x 2 y" -xy' + y 



x 4 y v 


= 


Jo = hx A y™ - \x 3 y'" + \. 


x 3 y v 


= 


h = h 3 y iv - \x 2 y'" + xy' 
h = \x 2 y m - xy'" + y" 


x 2 y v 


= 


xy" 


= 


h = xy™ - y'" 


iy v 


= 


- h = y iv - 



(18) 



Proposition 4 For the fifth order equation, y" = 0, the autonomous inte- 
gral emanating from the integrating factor y appearing in equation (14) can 
be obtained from the linear combination J = IqIa — I1I3 + -^t 1 where the 
Ii,i — 0, 1,2, 3,4 are as given in (18) above. 

We now consider the case for which n = 9, that is, the equation which was 

mentioned but not treated in [13]. In this case the equation takes the form 

y ix - 0. (19) 

The autonomous integrating factors corresponding to (19) are y, y" , y lv , y vt and 
y vm . If we use y in (19) as the integrating factor and integrating the subsequent 
equation we obtain 

viii I vii , // vi III v i iv j /or^^ 

yy -yy +yy-yy+^y = J- (20) 

The integral in (20) has Lie-point symmetries 
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G x = d x 

G 2 = xd x + Aydy 
G 3 = x 2 d x + 8xyd y 

which form the sl(2, R) subalgebra as expected. The rest of the integrals corre- 
sponding to the other integrating factors are given as follows: 

. n ix ..II viii _ uivii i ,IIH .vi _ l„,f 2 

y y y y y > y y 2 " 

III! ix III! viii _ v vii i l„.vi 2 

y y y y y y < 2" (9-1 \ 

vi ix vi viii _ lyVii 2 *■ ' 

lt viii„ t ix . \ q .viii 

y y 2 " 

Moreover it is important to note that the higher the order the more tedious 
the expressions become for the fundamental integrals hence the need for a more 
compact generating function for the fundamental integrals. In the case of the 
9th order ordinary differential equation we list these just to illustrate the point 
as follows: 



x 8 y lx I = x 8 y viii - 8x 7 y vU + 56x 6 y m - 336x 5 y v + 

I680x 4 y iv - 672CteV" + 20160x 2 y" - 40320a:y' + 40320y 

x 7 y lx h = x 7 y vUl - 7x e y vU + 42x 5 y m - 210x 4 y v + 

8A0x 3 y lv - 2520x 2 y'" + 5040xy" - 5040y' 

x 6 y lx I 2 = x 6 y viii - 6x 5 y vU + 30x 4 y m - 120x 3 y v + 

360x 2 y lv ~ 720xy'" + 720y" 

x 5 y ix I 3 = x 5 y viu - 5x 4 y vU + 20x 3 y m - 60x 2 y v + 

120xy iv - 120y'" 

x 4 y lx h = x 4 y viu - 4x 3 y vU + 12x 2 y m - 2Axy" + 2Ay lv 

x 3 y lx I 5 = x 3 y viu - 3x 2 y ml + 6xy vi - Qy" 

x 2 y ix j g _ x 2 y viii _ 2xy vit + 2y m 

iy lx — » h = y v " 



,'1:1/1/1 



(22) 

This then makes the construction of J using the fundamental integrals a 

daunting exercise as the order of the equation increases. However one can verify 

that the integral J = yy vm — y'y m% + y"y m — y'"y v + \y tv can also be obtained 

as a linear combination of the I^s with i = 0, 1, 2, 3, 4, 5, 6, 7 above. 
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5 Conclusion 

We have discussed some aspects on the integrability of ordinary differential 
equations. The discussion extended to the relationship between the basis set of 
solutions of an nth order ordinary differential equation of order n, its solution 
symmetries and associated fundamental integrals. Of interest was the integral 
constructed from these fundamental integrals. As the order of the equation 
increases, particularly, from order n > 9 the construction of the sl(2, R) integral 
from the fundamental integrals would involve very long expressions and hence 
a need for a construction of a more compact generating function to generate all 
such integrals. Otherwise the same sl(2,R) integral can be obtained by direct 
integration from the original equation after multiplying it by the autonomous 
integrating factor y. We also observe that the sl(2,R) integral only occurs in 
y( n ) = o if n is odd and n > 3. This discussion needs to be extended to all 
other equations of maximal symmetry and not necessarily with the simplest 
case considered here and to extend it to generalised symmetries. 
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Abstract 

A rectangular membrane with fixed boundary conditions under applied 
a transversal force has the solution with singularity. That is the Green's 
function of this problem. But if a string is considered, which could be 
taken as a strip from the above membrane under the same force then 
the Green's function has not a singularity. The MAC solution for the 
membrane will be considered. This solution transforms the above Green's 
function with singularity into the MAC function which does not have a 
singularity. The obtained MAC solution for membrane corresponds to the 
Green's function of a string. 
KEY WORDS: membrane, Laplace equation, MAC solution. 

1 Introduction 

Membranes are considered in biology, chemistry and physics. Information about 
experiments and theories is presented in Internet by the companies Lockheed, 
Volvo, U.S. Army Research Office etc. A number of journals and problems 
concerning membrane theories are presented in references (1) - (12). We can 
conclude that the membrane problem is important and it is under consideration 
of many research groups. 

The Green's function method is an important approach in membrane theory, it 
is widely used in micromechanics and nanomechanics too (4), (9). 
The mathematical aspect of the mechanical membrane is considered in this 
paper. The MAC solution of the stated problem will be considered instead of 
the strong or week solutions (1), which are usually under consideration. It will 
be shown that the strong and the week solutions are not physically acceptable in 
the case when one part of the boundary conditions is given at the point inside the 
domain which a membrane is occupied. The MAC model of the membrane will 
be obtained, which solution is called MAC solution. If the classical equation 
of the membrane under small deformation is a wave equation then the MAC 
equation is an intcgro-differential equation. The conformal mapping is used to 
create the MAC Green's function and the method of superposition is applied to 
create the MAC model. 
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2 Circular elastic membrane under point load 

2.1 Statement of the membrane problem 

The potential energy of the initially plane elastic membrane is 



U = To-( (Jl + u x 2 + u y 2 -l)dxdy); (1) 



where membrane lies in plane (x, y) in its natural state, To is its tension on a unit 
of length, u(x,y) - transversal displacement of the point (x,y) of the initially 
plane membrane, D - domain of the plane (x, y) occupied by a membrane, 
external forces are not applied. Apply the following boundary conditions: 

u\od = 0, (2) 

u(a, b) = uq ^ 0, (3) 

dD is the boundary of the boundary of D without a point (a, 6), which is an 
internal point in D. We will consider the linearized equations of the membrane 
with the elastic potential energy: 



U = 4-( / (u x 2 +u y 2 )dxdy); (4) 



2 J J D 

where it ie supposed that \u x \ << 1 and \u y \ << 1. The membrane equation in 
this case of small strains is the 2D Laplace equation: 

dx 2 ' dy 2 

2.2 Three types of solutions of the membrane problem 

The Laplace equation may have the following three different types of solutions: 

• Strong solution - it satisfies the equation and continuously the boundary 
conditions; 

• Generalized solution of the 1st kind - it is a limit of the sequence of strong 
solutions; 

• Generalized solution of the 2nd kind (MAC solution) - it is a function 
which is connected with Laplace equation and some additional conditions. 
In this case we will create a MAC model of the problem under considera- 
tion. 

Consider these three types of solutions for a circular membrane of radius R. 
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2.3 Strong solution of nonlinear membrane problem 

The potential energy for a nonlinear symmetrical problem is 



U = T -( / (Vl + «r 2 -1)t -dr); (6) 

Then the differential equation for a circular membrane is 

1(^=^=0. (7) 

dr VI + u r 2 

The boundary conditions are 

u(0)=u ,u(R) = 0. (8) 

The general solution of this ordinary differential equation (7) has the form 

u = A- Arch(^-)+B, (9) 

where A, B are arbitrary constants. It follows from the boundary condition 

«(0) = u , (10) 

that a constant A = 0. Then a constant B must satisfy: 

B = u o ^0, (11) 

and 

B = 0. (12) 

So we came to the conclusion that the strong classical nonlinear solution of the 
problem does not exist. 

2.4 Strong solution of linear membrane problem 

Consider now the linearized membrane problem. In this case we have the elastic 
energy in the form 

To / [ [ 2 



U =ir-(J J D u r-r-dr). (13) 

In the axisymmetrical case the differential equation is the Laplace equation, 
which is 

r dr dr 
The boundary conditions are as before in nolinear case Eqs.(8): 

u(0) = mo ^ 0, u(R) = 0. (15) 
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Then the general solution of the Eq.(14) is 

u = A-\n(r)+B; (16) 

A, B - arbitrary constants. 

To satisfy boundary conditions we obtain 

u(0) =u = B,A = 0; u(R) = = B. (17) 

It meens that we have a contradiction as a costant B can not be zero and 
nonzero simultaneously. So we can conclude that the classical symmetrical 
strong solution of linear membrane problem does not exist. 

2.5 Week solution of linear membrane problem 

We will call week solution the generalized solution of the 1st kind which is 
obtained as a limit of the sequence of strong solutions as follows. 
Consider linear membrane equation for a ring: 

0<e<r<R; (18) 

Here e is a small radius of a hole near the origin. The Laplace equation in this 
symmetric case is 

1 d du 

r dr dr 
We apply the boundary conditions 

u(e) = u Q ^ 0, u(R) = 0. (20) 

If e will tend to zero then the stated problem will remind us about the problem 
for a circle Eqs(14),(15). The solution of the given boundary value problem 
Eqs.(18),(19),(20) is 

U= J* j n(r ) (21) 

m(e) 

where the external radius R is taken as R — 1 that does not create any restric- 
tions. If the internal radius 

e^O (22) 

then the generalized solution of the 1st kind for a circle or in our notations is 

u = mo for r = e -> 0, (23) 

u = for < r < 1. (24) 

This solution is not a continuous function of r. This solution we will call as a 
week solution of our linearized membrane problem. 
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2.6 Week solution of nonlinear membrane problem 

It is not too difficult to see that the approach from the preceding subsection, 
applied to the nonlinear membrane problem, does not give any solution at all, 
because the limit when e tends to zero does not exist. 

2.7 MAC solution of the linear membrane problem 

To overcome the problem of nonexistence of solutions of membrane problem 
the MAC model will be introduced. To create the MAC model consider any 
diameter of a cirlular membrane. Then the string along this diameter is taken. 
Consider now the following elastic string problem. The elastic potential energy 
of a string is 



U = T -( (y/l + u x 2 -l)dx); (25) 



We consider a stretched string which lies along the x axis in its natural state 
from x — to x = L, Tq is a tension of a string; u(x) is transversal displacement 
of a point x in plain (x,u); L is the length of a string. 
The boundary conditions of the string are 

u(0) = 0; u(L) = 0; (26) 

u(a) =u ^0, (27) 

where a is an internal point of a string, < a < L. 
The string equation is: 

The solutions of the linear and nonlinear string problems coinsidc. That solution 

is 

x 
u(x) = uq ■ — , < x < a; (29) 

a 

L — x 

u(x) = Mo • j a < x < L. (30) 

1j — a 

We take a — -j in the obtained solution(29),(30) and because of the symmetry 
of the membrane problem the MAC solution of the membrane problem (14), (15) 
is defined as 

u(r) = u Q ■ (1 - r). (31) 

It is easely to see that the introduced MAC solution Eq.(31) corresponds to the 
experiments with membranes. Anybody can check it at home without scientific 
laboratories. The strong solution of this problem does not exist and the week 
solotion Eqs.(23),(24) is a nonsense. 

The MAC solution will be used to obtain the MAC Green's function for rect- 
angular membrane in the next section. 
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3 MAC solution for rectangular membrane 

3.1 Statement of the problem 

To obtain the MAC Green's fuction for rectangle consider the linear membrane 

problem for a rectangle. The rectangle occupies the domain D : —K\ < x < 

K u 0<y<K 2 . 

The linearized equation of the membrane is the Laplace equation Eq.(5) or 

Au = 0. (32) 

The boundary conditions are 

u\ dD = 0, (33) 

u(a,b)=u o ^0, (34) 

where the point (a, b) is an internal point of the rectangle and dD is the bound- 
ary of the rectangle. 



3.2 Conformal mapping 

The rectangle — K < x < K\, < y < K 2 in the complex plane z = x + i ■ y is 
conformal mapped into the circle x\ + y\ < 1 in the complex plain z\ = xi+i-yi: 

_ sn(z; k) - sn(d; k) 
sn(z; k) — sn(d; k) 

where < k < 1 is given constant and d = a + i ■ b = (a,b) is an inner point in 

rectangle; 

sn(z, k) is elliptical sinus; 

K 1 = sn(l;k), (36) 

K 2 = sn{pk)-K 1 . (37) 

The point (a, b) in the z-plane is mapped into the origin in the zi-plane. The 

unknown displacement u will then depend on the independant variables xi,yi. 

We have the following boundary value problem: 

Domain D\ is a circle x\ + y\ < 1. 

The equation is again the Laplace equation Eq.(5): 

d 2 u d 2 u „ . . 

oxi+wr - (38) 



The boundary conditions are 



U\xl+yl = l = ' ( 39 ) 

w(0,0) = w - (40) 
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The problem Eqs.(38),(39),(40) is the same problem which was considered in 
the preceding section. As solution of this problem we take the MAC solution 
Eq.(31). And finally we obtain the MAC solution of the membrane problem in 
the form: 

u(x,y) = u -(l-\zi\), (41) 

where z\ is obtained from conformal mapping of rectangle into the unit circle: 

sn(z; k) - sn(d; k) 

Z\ = , {^2.) 

sn(z; k) — sn(d; k) 

3.3 MAC Green's function 

Transversal force acting at the point d = (a,b) of rectangle is: 

p -L&* (43 » 

The function P(uq, a, b) depends on the point (a, b) and the applied displacement 
uq. If the appied point load P is given then we can solve the equation 

P(u ,a,b) = P (44) 

and find 

wo = u (P, a,b)=P- f(a, b) = P ■ /(d). (45) 

If we put uq from the Eq.(45) into the Eq.(41) then wc obtain the MAC solution 
in the form of the MAC Green's function for a rectangular membrane. So we 
have 

u(x, y) = P- /(d) ■ (1 - M) = P ■ M(z, d), (46) 

where a point z — (x, y) and the MAC Green's function of rectangular membrane 
is 

M{, : A) = f(J) ■ (1 - | Sn( * ; fc) ~ Sn(d; k) |) (47) 

sn(z; k) — sn(d; k) 

4 Integro-differential equation for MAC mem- 
brane model 

Principle of superposition allows to write the new membrane equation in the 
following form of integro-differential equation: 

r r d 2 u 

u(x, y, t) = J J M{z, d) ■ p(d) ■ -^-(d, t) dD, (48) 

where M(z, d) - is the MAC Green's function of the domain D, 

u{x, y, t) is the transversal displacement of the point d of the membrane, 

p(d) - mass-density of the membrane per unit area at a point d, 

t - is time. 
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5 Conclusion 

• The strong solution of nonlinear and linear circular membrane problem 
does not exist, 

• The generalized solution of the 1st kind (week solution) does not exist in 
nonlinear membrane problem, 

• The generalized solution of the 1st kind (week solution) exists in linear 
membrane problem: 

it satisfies the Laplace equation and does not satisfy the boundary condi- 
tions continuously, 

• The generalized solution of the second kind (MAC solution) and MAC 
Green function are introduced as solutions of the MAC membrane model: 
they satisfy the boundary conditions and does not satisfy the Laplace 
equation. 

The obtained integro-diffcrcntial equation is suggested to use in membrane prob- 
lems of more complicated type. 
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The Gibbs phenomenon refers to the lack of uniform convergence which oc- 
curs in many orthogonal basis approximations to piecewise smooth functions. 
This lack of uniform convergence manifests itself in spurious oscillations near 
the points of discontinuity and a low order of convergence away from the dis- 
continuities. In previous work [11,12] we described a numerical procedure for 
overcoming the Gibbs phenomenon called the Inverse Wavelet Reconstruction 
method (IWR). The method takes the Fourier coefficients of an oscillatory par- 
tial sum and uses them to construct the wavelet coefficients of a non-oscillatory 
wavelet series. However, we only described the method standard wavelet se- 
ries and not for the continuous wavelet transform. We also only described the 
method for so-called crude wavelets: wavelets which do not admit a scaling 
function, and therefore lack a multiresolution analysis. Here we describe a 
version of IWR based on a quadrature discretization of the continuous inverse 
wavelet transform. We also describe the multiresolution potential of the Inverse 
wavelet reconstruction method, which is the ability to decompose a function into 
a coarse trend plus finer details. 

Key Words: Gibbs phenomenon, Fourier Series, Inverse polynomial recon- 
struction, Wavelets, Inverse wavelet reconstruction. 

1 Introduction 

Fourier and orthogonal polynomial series are known for their highly accurate 
expansions for smooth functions. In fact it is known that the more deriv- 
atives a function has, the faster the approximation will converge. However, 
when a function possesses jump-discontinuities the approximation will fail to 
converge uniformly. In addition, spurious oscillations will cause a loss of ac- 
curacy throughout the entire domain. This lack of uniform convergence is 
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known as the Gibbs phenomenon. Methods for post-processing approximations 
which suffer from the Gibbs phenomenon include the Gegenbauer reconstruc- 
tion method of Gottlieb and Shu [7, 9], the method of Pade approximants due 
to Driscoll and Fornberg [2] , the method of spectral mollifiers due to Gottlieb 
and Tadmor [8] and Tadmor and Tanner [21, 22], the Inverse polynomial recon- 
struction (IPR) method of Shizgal and Jung [14,15,16,19,20], and the Freund 
polynomial reconstruction method of Gelb and Tanner [6] . These reconstruction 
methods can be combined with an effective method for edge-detection developed 
by Gelb and Tadmor [3, 4, 5], to yield an exponentially accurate reconstruction 
of the original function. In previous work we described a numerical method 
for overcoming the Gibbs phenomenon following the work of Shizgal and Jung, 
called the Inverse Wavelet Reconstruction method (IWR) [10,11,12]. The IWR 
method was shown numerically to perform at least as well as inverse polynomial 
reconstruction. 

However, our main justification for choosing wavelets over other orthogo- 
nal or non-orthogonal bases, lies in ability of wavelets to decompose a function 
into a coarse trend and finer details. This decomposition is essentially what is 
known as a multiresolution analysis. Our previous papers on the IWR method 
focused on so-called crude wavelets: wavelets which do not admit a scaling func- 
tion and which therefore do not offer a multiresolution analysis The present 
paper addresses the notion of multiresolution reconstruction from a Fourier par- 
tial sum. Our previous papers described the method using standard wavelet 
series approximations. The present paper also addresses how the method can 
be implemented using the continuous wavelet transform (CWT). The CWT is 
truncated and discretized using a uniform Trapezoidal rule or a Gauss quadra- 
ture grid and a non-standard wavelet series approximation is obtained. 

We begin with a brief review of the essential definitions of wavelets which 
we will need. Recall that a wavelet is a function i/iei 2 (R) satisfying: 



ip{x)dx = (1) 

J — CO 

and 

"^P«<«. (2) 

where *& here is the Fourier transform of ip. To avoid a conflict of notations we 
will use a capital letter for the Fourier transform and a hat above the letter for a 
Fourier coefficient. The function ip is known as a mother wavelet or an analyzing 
wavelet since any function / G L 2 (Ft) can be expressed as a continuous sum 
of translations and dilations involving tp according to the continuous wavelet 
transform. The continuous wavelet transform (CWT) is given by 



F v ,(6,a) = |a|- 1/2 / f(x)i>[- )dx 
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and the inverse CWT is given by 



i r r f ^(m 



fl?) = 7T / "^VW« ( 3 ) 

W J -oo J-oo a 



where 



and 



V(*)H a r 1/2 ^(V) 

^ = loo T^ 

A discrete wavelet series is given for fixed constants ao, 60 by 

00 00 

f ^ = 2 2 'W^W 0*0 ( 4 ) 

771= — OO / = — OO 

where the discrete wavelet series coefficients are given by 

and r/» m ; (x) in this case has a slightly different meaning: 

^ m ,l 0) = a™ 72 ^ (a™ a; - b l) . 

In general the wavelet functions ip m ; do not constitute an orthonormal basis. 
Instead they constitute what is known as a frame. The family of functions ip m i 

is a frame if 

00 00 

A\\ff< J2 E K/>vw>|<bii/ii 2 

m— — 00 l=— oo 

for positive constants A and B. When A — B the functions ipj k are known as 
a tight frame. For more information see Daubechies [1]. The inverse wavelet 
reconstruction method makes use of wavelet bases and frames, as well as the 
CWT. 

Much of the present paper focuses on wavelets which are known to have what 
is known as a multiresolution analysis. A scaling function, sometimes called a 
father wavelet, is function cj> (x) which satisfies the two-scale relations: 

OO 

<j)(x)= y^ hi<f>(2x-l) 

/ — -co 

and 

CO 

/ — -co 
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The functions </> m l (x) — 2 m / 2 (f) (2 m x — I) are orthonormal to each other and 
they are also orthonormal to tp m { (x) — 2 m / 2 ijj (2 m x — I). The functions <f> and 
ip allow for a decomposition of a function into the sum of a coarse approximation 
and finer details as follows: 

oo oo oo 

f( X ) = Y C Mo,l<l ) Mo,l( X )+ Y Y d ™,l^m,l( X ) 
I — — 00 m—Mo I — — oo 

which is truncated as 

L Af-l L 

f M ,L (x) ~ Y C M ,l<t>M ,l ( x )+ Y Y dm ^rn,l 0) • 
l=-L m=M l = -L 

The Cm ,i are known as approximation coefficients for the function at resolution 
scale Mq and the d m ,i are detail coefficients for the finer scales. This particular 
decomposition of a function into trend and details is known as a multiresolution 
analysis. 

2 The Multiresolution Inverse Wavelet Recon- 
struction Method 

2.1 Inverse Wavelet Reconstruction 

We begin with a Fourier partial sum and assume that the function / (x) can 
also be expressed as a discrete wavelet series: 

oo oo 

/ ( x ) = Y Y dm ^^,i 0) 

m=— oo I — — oo 

where 

/■OO 

d m ,i = I f (x) ip mJ (x)dx 
and 



— oc 



i>m,l ( x ) = a a m/2 ' t P (°Z Lx ~ W) 



We now derive a formula expressing the Fourier coefficients in terms of wavelet 
coefficients. 

f(n) = \j f(x)e-^dx 

1 OO CO 

dx 



2 / Y Y d ™,^rn,l ( X ) e ~ 

oo oo • _. „i ^ 

Y Y d ^ U / ^mA X ) e ~ ilmXdx 

-n=-oo l=-oo ^ 1 J 

CO CO 

Y Y d ™,ii>m,i ( n ) (5) 



m— — oo l — — c 
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The inverse wavelet reconstruction method is obtained by truncating the doubly 
infinite sum described above and solving for the wavelet coefficients. We suggest 
solving the following system of equations. 



M 



f(n)= E E d ™$™,i {n) (6) 



-_-Ml=-L 



for the wavelet coefficients d m ^i where n — —N...N and N — 2ML + M + L. 
One then computes the IWR approximation 



M 



SM,Lf (X) = E E d ™^m,l (X) ■ (7) 

m=-M X——L 

The terms ij) m ; (n) are the nth Fourier coefficients of ip m ; (x) . Since we are 
solving a system of 2N + 1 equations in (2M + 1) (2L + 1) unknown wavelet 
coefficients, in order for the system to be invertible we must have 2N + 1 = 
(2M + 1) (2L + f ), or N = 2ML + M + L. Numerical experiments indicate 
that the invertibility condition can be relaxed and fewer wavelet coefficients can 
be computed if one solves the overdetermined system in a least squares sense 
using Matlab's backslash operator. This reconstruction is valid for both crude 
and non-crude wavelets. 

2.2 Multiresolution IWR: Wavelet and Scaling Function 
Approach 

Here we describe the IWR method for wavelets which allow for a multiresolution 
reconstruction. A formal derivation of the method proceeds again as follows. 
Write 

f(n) = \j_ f(x)e- i7rnx dx 

1 /"l / °° oo oo \ 

= 2 / ( S c M a ,i4> Ma A x )+ E E d m ,i4> m ,i( x )) e ~ l ™ Xdx 

\l — —oo m-Mo l — — oo / 

= E c m„j Q 1^ 4>m ,i (x) e-^dx^j 

oo oo /-I pi \ 

+ E E ^.'(2/ ^ i(l)e "™ IdI 

m=M l=-oo ^ ^~ l ' 

oo 00 00 

= E C Mod4>M„,l (n) + E E Cm ^rn,l ( n ) (8) 



m-Mo / — — oo 
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The IWR method in this case is obtained by truncating the sums and solving 
the system of equations for the approximation and detail coefficients 

L M-l L 

f(n) = E Cm o^m ,i (n) + E E d m ^ myl (n) (9) 

l=-L m=M l = -L 

where n = —N...N . Numerical results again indicate that the system can be 
solved in a least squares sense and one need not be concerned with meeting an 
invertibility condition exactly. One then constructs the multiresolution IWR 
approximation 

S$ L f(x)=A™°(x) + D$ L (x) 

where the coarse trend is given by 

L 

A L° ( x ) = E C *W^M ,Z ( X ) 
l = -L 

and the finer details are given by 

M-l L 

D^ L (*) = E E d ™^m,i (x) ■ 

m=M l=—L 

2.3 Multiresolution IWR: Pure Wavelet Approach 

Next we describe an alternative multiresolution reconstruction for non-crude 
wavelets which does not make explicit use of the scaling function <fi. It is 
simply a modification of the original IWR approximation. We write 

S$ L f(x) = A$ L (x) + D$ L (x) 

where the coarse trend is given by 

M -l L 

A M°,L ( X ) = E E d m,l^,n,l 0) 
m=-M l=-L 

and the finer details are given by 

M L 

K°,L (X) = E E d ™^rn,l (3) ■ 

m=M a l=-L 

The derivation is simply to begin with the original IWR reconstruction and 
write 

M L 



s M xf{x) = E E dm < ; ^w( x ) 



m=-M l = -L 
M -l L ML 

E E d ™^™,l {X) + E E d ™^rn,l ( a 

m=-M l = -L m=M a l=-L 

A m",L ( x ) + -^M°L ( x ) 
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The Pure Wavelet Approach will only provide a good decomposition into trend 
and details for wavelets which admit a scaling function, even though the method 
does not make explicit use of the scaling function. This is because the two 
approaches are approximately equivalent, that is: 

A L ° (x) ~ A M a L (x) . 

In fact, 

L 

Al° {*) = E C Mo,l<f>M ,l (X) 
l=-L 

but also 

Afo-l L 

A L° ( x ) = E E d ™^m,l ( x ) 

m= — oo l = — L 

-M-l L M -l L 

E E d ™^rn,l ( X ) + E E dm ^rnU 0) 
m= — oo i = — L m= — Ml=—L 

= A- L M (x) + A^ L (x). 
Since A^ (a;) is the tail of a convergent series, A^ (x) — » as A'/ — > oo since 

3 A Version of IWR Based on the Continuous 
Wavelet Transform 

Finally, we describe here a related method for implementing IWR which is based 
on a quadrature discretization of the inverse continuous wavelet transform. It 
produces results which are comparable to the standard wavelet series approach 
described previously. Our chief reason for considering it is allows us to consider 
any wavelet family as a candidate for use in reconstruction. 
The derivation is as follows. 

1 ^ 



f(n) = zj_ f(x)e-™ nx dx 



-l \W J- 



oo «/ — oo 



*£M^H 



i r rfKMUi /■ j^\ e -™. to )(toa , 



1 ify(M) 



OO /*00 



-oo ^V 7 a 



V M (n) dadfc. (10) 



Note that in this section we use ?/>& a (n) to denote the nth Fourier coefficient of 
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Truncate the limits of integration and write 

/(»)~ r [ A ±^M.x {n)dadb . (ii) 



-A Cj/> a 2 

Next, discretizc the integrations using a quadrature rule such as the trapezoidal 
rule or Gauss quadrature. The resulting equations become 

^W^EE^^mW ( 12 ) 

which can be expressed in simpler form by letting d m j = J — * ^'""'■ , and 
writing 

M L 

/w-EEMmW' ( l3 ) 

m=0 1=0 

Finally, solve the system of equations, possibly in a least squares sense, for the 
coefficients d m ,i and construct the nonstandard wavelet series approximation: 

ML / _ J, \ 

/w^EE^hr 1 - (14) 

4 Numerical Results 

Numerical experiments show that the IWR method based on a quadrature 
discretization of the CWT yields uniformly converging approximations for all 
wavelet families we have tested. We display the results of the Haar, Shan- 
non, and Mexican hat wavelets in the figures below. The Haar and Shannon 
systems are some of the earliest examples of what later became wavelets. The 
Haar wavelet is apparently able to resolve an interior discontinuity without 
prior knowledge as to its location, as well as the endpoints. Therefore the Haar 
wavelet is able to provide a global reconstruction, albeit a low order accuracy 
one. The low order of convergence is due to the fact that the wavelet is not 
well-localized in Fourier space. Shannon wavelets will resolve the Gibbs phe- 
nomenon at the endpoints of the interval and Mexican hat wavelets will do it 
even better. 

Numerical experiments also show that the Multiresolution IWR method is 
able to decompose a function into a course trend plus finer details, given a func- 
tion's Fourier coefficients. We provide numerical examples using the both the 
Pure Wavelet Approach to reconstruction and the Wavelet and Scaling function 
Approach described above. We implemented the inverse wavelet reconstruction 
method by computing Fourier coefficients for the test functions in question using 
trapezoidal rule quadrature. This makes our Fourier series a pseudospectral 
series. For the method to be accurate we compute the coefficients tp m l (n) 
using the same trapezoidal rule quadrature. 
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Incise HaarV\fe\elet Ffeconstruction 





Figure 1: Inverse Haar Reconstruction of - tan 1 x based on 64 Fourier coeffi- 
cients using the wavelet series approach. 



362 



GREENE: WAVELET RECONSTRUCTION 



Intense Shannon V\ta\£let Fteconstructi on 




I 10 




Figure 2: Inverse Shannon reconstruction of - tan -1 x using the CWT approach 
from 128 Fourier coefficients. The limits of integration are A = B = 20 and 64 
Gauss-Legendre quadrature nodes in a and b are employed. 
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Figure 3: Converge of Inverse Shannon reconstruction of - tan -1 x using the 
CWT approach from 16, 32, 64, and 128 Fourier coefficients. The limits of 
integration are A = B = 20 and the number of Gauss-Legendre quadrature 
nodes in a and b are equal to the number of Fourier coefficients. 
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ln\eise hbarV\fevelet Reconstmction 



Absolute Enor 





0.5 



Figure 4: Inverse Haar reconstruction of sin(cos(4x)) sgn(x) utlilizing the CWT 
approach with A = B = 20, and Gauss-Legendre quadrature with 32 quadrature 
nodes in a and 32 quadrature nodes in b. The method resolves both endpoint 
and interior discontinuties, albeit with low accuracy. 
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lii£ra3l-fea"V\&detFtenstnjctcrt ©Kjbtffiderts 
1.5 





Figure 5: Inverse Haar wavelet reconstruction using the pure wavelet approach 
of the function / (x) = - tan -1 x + i sin30x based on 64 Fourier coefficients. 



We consider the test function / (x) = ^ tan _1 (x) + ^ sin (30x) to observe 
the ability of various wavelets to extract the trend, mostly due to tan _1 (x), and 
smaller scale oscillations due to ^ sin (30x) . We display here the results of 
the Haar wavelet: the simplest wavelet which admits a multiresolution analysis. 
We also consider the piecewise linear Franklin wavelets generated by the second 
order splines. 



5 Conclusions 

Numerical results indicate that the IWR method based on the continuous wavelet 
transform works as well as the IWR method based on a standard wavelet series. 
It yields an accurate and rapidly converging approximation. The multireso- 
lution IWR methods allows one to separate out from the Fourier coefficients a 
course trend and finer details regardless of any Gibbs phenomenon. Work in 
progress includes a numerical study of the multiresolution IWR method for a 
wider variety of wavelets, multiresolution reconstruction from series other than 
Fourier series, a two-dimensional implementation and a proof of convergence. 
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Haar Trend for 64 Coefficients 







Figure 6: Inverse Haar decomposition of / (x) — — tan 1 •"£ + ttj sin 30x into a 
trend and details using the pure wavelet approach. 
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Figure 7: Inverse Franklin decomposition of / (x) — — tan l x + y^ sin 30a; into 
a trend and details. 



GREENE: WAVELET RECONSTRUCTION 369 



[2] T. A. Driscoll, and B. Fornberg, A Pade-based Algorithm for Overcoming 
the Gibbs Phenomenon, Numerical Algorithms, 26, 2001, pp. 77-92. 

[3] A. Gelb, and E. Tadmor, Detection of Edges in Spectral Data, Appl. Comp. 
Harmonic Anal., 7, 1999, pp. 101-135. 

[4] A. Gelb, and E. Tadmor, Detection of Edges in Spectral Data II. Nonlinear 
Enhancement, SIAM J. Numer. Anal, Vol. 38, No. 4, 2000, pp. 1389-1408. 

[5] A. Gelb, and E. Tadmor, Spectral Reconstruction of Piecewise Smooth 
Functions from their Discrete Data, Mathematical Modeling and Numerical 
Analysis, 36:2, 2002, pp. 155-175. 

[6] A. Gelb, and J. Tanner, Robust Reprojection Methods for the Resolution of 
the Gibbs Phenomenon., Applied Computational and Harmonic Analysis, 
Vol. 20, 1, 2006, pp. 3-25. 

[7] D. Gottlieb, C.-W. Shu, A. Solomonoff, and H. Vandeven, On the Gibbs's 
Phenomenon I: Recovering Exponential Accuracy from the Fourier Partial 
Sum of a Nonperiodic Analytic Function, J. Comput. Appl. Math., 43, 1992, 
pp. 81-92 . 

[8] D. Gottlieb, and E. Tadmor, Recovering Pointwise Values of Discontinu- 
ous Data within Spectral Accuracy, in: Progress and Supercomputing in 
Computational Fluid Dynamics, Proceedings of a 1984 U.S. -Israel Work- 
shop, Progress in Scientific Computing, Vol. 6 (E. M. Murman and S. S. 
Abarbanel eds.), Birkhauser, Boston, 1985, pp. 357-375. 

[9] D. Gottlieb and C.-W. Shu, On the Gibbs Phenomenon and its Resolution, 
SIAM Rev., Vol. 39, 1997, pp. 644-668. 

[10] N. Greene, On the Recovery of Piecewise Smooth Functions from their Inte- 
gral Transforms and Spectral Data, Ph.D. Dissertation, SUNY Stony Brook, 
2004. 

[11] N. Greene, A Wavelet-based Method for Overcoming the Gibbs Phenom- 
enon, in: Recent Advances on Applied Mathematics: Proceedings of the 
American Conference on Applied Mathematics, Cambridge, Massachusetts, 
March 24-26, 2008, pp. 408-412. 

[12] N. Greene, Inverse Wavelet Reconstruction for Resolving the Gibbs Phe- 
nomenon, International Journal of Circuits, Systems and Signal Processing, 
Issue 2, Vol. 2, 2008, pp. 73-77. 

[13] J.-H. Jung, and B. D. Shizgal, Generalization of the Inverse Polynomial 
Reconstruction Method in the Resolution of the Gibbs Phenomenon, J. 
Comp. Appl. Math., v. 172, n.l, 2004, pp.131-151. 



370 GREENE: WAVELET RECONSTRUCTION 



[14] J.-H. Jung, and B. D. Shizgal, Inverse polynomial reconstruction of two 
dimensional Fourier images, J. Scientific Computing, v. 25, n.3, 2005, pp. 
367-399. 

[15] J.-H. Jung, and B. D. Shizgal, On the numerical convergence with the 
inverse polynomial reconstruction method for the resolution of the Gibbs 
phenomenon, J. Computational Physics, 224, 2007, pp. 477-488. 

[16] B. D. Shizgal and J.-H. Jung, Towards the resolution of the Gibbs Phe- 
nomena, J. Comput. Appl. Math., 16, 2003, pp. 41-65. 

[17] R. Pasquetti, On Inverse methods for the Resolution of the Gibbs Phenom- 
enon, Journal of Computational and Applied Mathematics, Vol. 170, no. 2, 
2004, pp. 305-315. 

[18] E. Tadmor and J. Tanner, Adaptive Mollificrs - High Resolution Recovery 
of Piecewise Smooth Data from its Spectral Information, J. Foundations of 
Comp. Math. 2, 2002, 155-189. 

[19] E. Tadmor and J. Tanner, Adaptive Filters for Piecewise Smooth Spectral 
Data, IMA J. Numerical Anal, Vol. 25, 4, 2005, pp. 635-647. 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS, VOL.8, NO.2, 371-379, 2010, COPYRIGHT 2010 EUDOXUS PRESS, LLC 

THE CALCULATION OF AXISYMMETRIC DUCT GEOMETRIES FOR 

INCOMPRESSIBLE,ROTATIONAL FLOW USING AN INTEGRAL 

FORMULA BASED ON GREEN'S THEOREM. 

Vasos Pavlika 

Harrow School of Computer Science 
University of Westminster, School of Computer Science, Harrow, Middlesex, UK 

V. L . Pavlika@westminster .ac.uk 



Key Words: Rotational incompressible flow, Green's function and Integral Formula. 



ABSTRACT. 



In this paper a numerical algorithm is described for 
solving the boundary value problem associated with 
axisymmetric, inviscid, incompressible, rotational to 
obtain duct wall geometries from prescribed wall 
velocity distributions. The governing equations are 
formulated in terms of the stream function iff (x,y) and 

the function (p (x,y) as independent variables where for 
irrotational flow <p (x,y) can be recognized as the 
velocity potential function, for rotational flow (p (x,y) 
ceases being the velocity potential function but does 
remain orthogonal to the stream lines. The numerical 
method uses an integral formula on a uniform mesh. The 
technique described is capable of tackling the so-called 
inverse problem where the velocity wall distributions are 
prescribed from which the duct wall shape is calculated, 
as well as the direct problem where the velocity 
distribution on the duct walls are calculated from 
prescribed duct wall shapes. Numerical results for the 
case of the Dirichlet boundary conditions are given. 

1. INTRODUCTION 

Designers of ducts require numerical techniques for 
calculating wall shapes from a prescribed velocity 
distribution. The objective of the prescribed velocity is 
typically to avoid boundary layer separation. At inlet a 
velocity is prescribed to allow for a vorticity to be 
present calculated from ffl = VAV where the A 
denotes the usual cross product of vectors, (O is the 
vorticity vector and V the velocity vector respectively. 

The present paper describes a numerical algorithm for 
solving the boundary value problem that arises when the 
independent variables are (p and lp~, where (p maybe 
identified as the velocity potential function (for 
irrotational flow only), for flow with vorticity, <p ceases 

being the velocity potential function but does remain 
orthogonal to ^ which may be identified as the stream 
function. The dependent variable y, is the radial 
coordinate and x is the axial coordinate. The numerical 
technique is based on an integral formula using finite 
difference on a uniform mesh. 



2. THE DESIGN PLANE. 

As shown in Pavlika [4] when the independent variables 
are <p(x,y) and y/(x,y) where the <p(x,y) and 

ip~(x 7 y) have been previously defined it can be shown 

that the governing partial differential equation that the 
radius satisfies is given by: 



8 



A dy 



+ - 



d 



B dy 



d(p\B dq> ) dy/yAdy/ 
with the speed calculated from 



= 



1 



J_ 

~A 2 



f 



dy_ 



+ - 






dq> 



(1) 



(2) 



\ U VJ 

and completion of the physical coordinates are provided 
from 

dx = dcp dip' 

A dif B dcp 

where x is the axial coordinate and A and B satisfy their 
own first order quasi-linear hyperbolic partial 
differential equations with characteristics parallel to the 
<p and ip~ axes which maps the physical flow field into 

an infinite strip in the (<p , iff) plane. In fact the A and B 

d 



satisfy: 



and 



(\og(A)) = ^B 
o<p q 

jL(\o g (B)) = -^fA 
dip- q 



(3) 



(4) 



Regarding temporarily r), co a and q as known functions 
of <p and iff the system (3) and (4) as previously 

mentioned is quasi-linear hyperbolic with characteristics 
parallel to the iff and ip~ axes which maps the physical 

flow field into an infinite strip in the (<p, xp~) plane. 
Bearing in mind the freedom available in the stream wise 
variation of <p and the cross stream variation of iff , 

suitable values of A can be prescribed along one <p 
characteristic and those of B can be prescribed along one 
iff characteristic. 

3. The NUMERICAL ALGORITHM IN 
THE DESIGN PLANE. 

Rewriting the partial differential equation that y satisfies 
i.e. Eq. (1) as: 
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d ( c dy_ 
d<j> 



d<j> 



^ d 

+ — 

dif/ 

A 



1 dy 
C dif/ 



= c 



(5) 



where C = — , for problems posed in the design plane 

c=0, the value of C will vary depending on whether the 
flow field is irrotational or swirl free etc. Eq. (5) will be 
re-written as a Poisson equation that is as: 



, c 1 6 2 v 

C C 2 dif/ 2 



f 



dq> 



log e I C 



i a 



C dif/ 






dy 
dif/ 



is that v(<p, iff) = on C in addition to v(<p, iff) being 

harmonic and for the Neumann conditions on t, the 

dv 
requirement is that — = and v(<p, i/f) once again 
dn 

satisfying Laplace's equation. Much literature is 

available for the Green's function for the Laplace 

equation (see Williams [6]) and need not be mentioned 

here. Hence for the Dirichlet problem without loss of 

dy generality setting F((p, Iff) = 1 \f <p, if/ and. for interior 

d(p points: 

(6) 2rt(<p ,%) = ^t^ds-$v D g(—,—)d<pdy/ 



where V is the usual Laplacian operator in rectangular 

2 / d 2 y dy dy 

coordinates so that V y = ?( — , , , C , c) 

dl/f o<p dl/f 

where g is a function of the arguments shown as defined 
by expression (6). 

4. SOLUTION BASED ON AN 
INTEGRAL FORMULA USING 
GREEN'S THEOREM. 

Here the method of solution is derived using an integral 
formula. Commencing with the generalized form of 
Green's theorem for the self adjoint elliptic operator E(t) 
in normal form given by: 

f f vE(t) - tE (A) (v)d<pdq/ = $t — -v — ds 
J - ' ' i dn dn 

where t = y 2 , E(u) = E (A) (u) where E (A) (u) is the 

adjoint of E and v is the fundamental solution to the 
adjoint equation. In this case the adjoint equation is 

given by V V = and E(t) = g as defined by Eq. (6). 
The contour C bounding the surface R is traversed in the 
counter clockwise sense. For a doubly connected region 
introducing a singularity at the point (<p Q , l/f ) (inside or 
on the contour C) and assuming 
v(<p, i/f) = F{q>, i/f) log e | r | so that the distance r is 

given by: r = ((<p - % f +{W~ Vo f ) 

with F((p, iff) analytic, then it can be shown that 



mnt(<p ,if/ )F(<p ,i// ) = j> 



dv 



t- 



■lfvg( 



dn 
d l t d 2 L 



c 

}2* ^2 ] 



dt , 
v — ds 
dn 



)d<pdi//, m = l,2 



dq? dq> 2 

with L = log e t . Now m = 2 if (q> ,if/ ) is within C 
and m=l if (<p Q ,l// ) is on C (the m=l case can be 
shown using the appropriate Plemelj formulae or by 
indenting the contour at (<p Q ,l/f )). For the Dirichlet 
case of boundary condition of t {<p, if/) the requirement 



(J) 
i.e., the Green's function v D satisfies Laplace's 

equation on <5Q where <5Q is defined by: 

% < q>< qfa +1 , l// < \f/< \f/ N+l and vanishes on C. For 

the Neumann problem 

2xt{%,n) = -6v N —ds- \\v N g(— -,— -^(pd// 
J c on j jj d(p oq> 

Which give integral formulae for the square of the radius 
t, from which the radius y can be determined., above 
Green's function v N satisfies the Laplace equation on 

dv 
dQ, with — — vanishing on C. Knowledge of the 
dn 

dt dt 

derivatives and are also required for the 

dq> di/f 

determination of the speed q given by Eq. (2) hence 
differentiating under the integral sign above with respect 

dt 

to <p and i/f gives integral formulae for both and 

dq> 

dt 

, such that: 

dif/ 

_ d , . f dt dv n d 2 v n , 

2n — t(<fi,t/Y ) = <J) — .—2-+t — -ds 

^< if " J c di// dn dxpdn 



di// 



d 



-JK£ 



dif/ 

and similarly for 

2x— K%,%)- 
dq> 



fit $L 



A 



dv D tft &L^ 



dq) ' dq} J dif/ dq} ' dq} ' 
dt 
dq> 



-.&>- 



dt dv n &v 



r D 



\dq> dn 



+t- 



D 



dqdn 



ds 



-h 



d 



( 



g( 



d 2 t d 2 L^ 



dq>\ dq} ' dq} 



dv D fit d 2 L 
+-^.g(—T,—j)dqxIy/ 
dq) dqf dqf 
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5. ITERATIVE SOLUTION. 

To convert formula (7) to a system of linear algebraic 
equations the point t(<p ,lf/ ) inside C is related to its 
boundary values on C. To obtain the first 
iterates t. (q> , i/f Q ) , g ; is set equal to zero, so that 

2N+2M+4 f a,, \ 

2*e= y ' ° 



dn 



t j As j i = 0,l,2,...2N + 2M+4 



Using the trapezoidal rule 

2N+2M+4 1 / -v, \ 

i = 0,l,2,. ..2N + 2M+4 

2N+2M+4 



s M~ s r 



-.)' 



■ f«= ^ ^(v D ,5) ^.,i=0,l,2,...2N + 2M+4 






Wherer(v D ,,)=--^(, j+1 - Vl ) 



4;r 



<5« 



Using this method there is a simple self-consistency 
check, i.e. the tj are known upstream and downstream for 
j=0,U,. ■ . ,N+1 and j=N+M+3,N+M+4,. . . .2N+M+3, 
hence the first iteration may be written as: 



\-K, T ^ -K, 



*-N+2 



N+l 



-K, 



N+2 



l-K ATl , -K 7 



*-JV+3 



'AT+4 



-K-, 



N+2 



-K, 



■2N+2M+4 



-K, 



■2N+2M+4 



^2N+2M+4 ^2N+2M+4 



i 



N+2 



l N+3 






Yfh 



,<h m A«t m = b 



(0 



(S) 



where the summations on the right hand side are 
performed over j =0,1,... N+l and j=M+N+3, 
M+N+4...,2M+N+3. Once the first iterate tj has been 
calculated the field integral containing g is then 
computed, where the central difference approximation to 
the second derivative is used, this is then introduced into 
the right hand side of Eq. (8) and compute the second 

(2) 

iterate, t The procedure is repeated until some 



,(*■) 



■t 



k-l 



,<£. 



convergence criteria is satisfied e.g. || t t , : M 

where e is a constant and the p denotes the p-norm (p=l, 
2 or co). 



6. AXISYMMETRIC FLOW IN THE 
ABSENCE OF BODY FORCES. 

Here numerical solutions to inviscid axisymmetric flow 
with constant vorticity and a swirl velocity will be 
derived. The axial velocity component u x (y) at inlet will 
be chosen to be of the form u x (y) =oy+jB, where ctand /3 
are constants chosen such that u x (yi)=U] and u x (y 2 )=u 2 
where yi represents the inner radius and y 2 the outer 
radius at inlet. The swirl velocity u a (y), will be of the 

form U a (y)= ky + — where the k and 1 are constants 

y 

with ky representing solid body rotation and l/y the so- 
called free vortex term respectively. 



7. THE FLOW EQUATIONS IN THE 
PHYSICAL PLANE (y,a,x). 

Adopting cylindrical polar coordinates with y being the 
radial coordinate, a the circumferential and x the axial 
coordinate, defining velocity components u y ,u a and u x 
with corresponding vorticity components m y , co a , m x in 
the direction of increasing y, a and x respectively, then 
the equation of motion with unit density becomes: 



Du 
Dt 



= -V.p 



D 



(9) 



Where is the material derivative. Eq. (9) can be 

Dt 
written using well known vector identities as: 



dt 
du 



- + M„ 



dx 



- + u„ 



dy y 



dp 
8y 



.^ = o 



y 

dp 
dx 



(10) 



du a du a 

— +u x — +u y — 
dt dx dy 

du r du r du r 

+ W x + U y 

dt dx dy 

Furthermore —= + (w.V)w = -V.p 
dt — - 

can be written (once again using an appropriate vector 

identity as) 

-= + (a a u) = -V( p + - q 1 ) . Thus 

dt - 2 

for steady flow Crocco's form of the equation of motion 
is obtained, i.e. 



(HAffl) = VH 



(11) 
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where H is the total head defined by H = p H — a . 

2 

Calculating the cross product on the left hand side of 

Eq. (11), gives 

dH 



= u a (o x -u x (o a 



(12) 



dy 
= 

dH 

dx 

In addition for axisymmetric flow the vorticity, 0) 
becomes 

du„\ + \du y duA 



Q = U x (D y -U y 6) x 



= U y (0 a- U a (0 , 



CO = V AW = 



dx 



[ dx dy 



-a 



\y fy J 

and the equation of continuity is given by 



(13) 



V.u = 



d(yu x ) | d(yu y ) 
dx dy 



= 



8. THE DESIGN PLANE 
COUNTERPARTS. 

In order to compute numerical solutions in the design 
plane, expressions are required for the terms A, B 
andfi) , thus 



du r du 
dx dy y 



1 ( Q y ^ d /, , xx 

u x — + u\ = -q — (log(y)) 



v 



or 7 : 



2 a 



9 



5 e^ 



? 2 d 



(log(y)), 



but 7/ = ^- — (log(^)) 
5 dq> 

dy/ yq 

thus j4_y = / (iff) , that is = . The arbitrary 

dn f(y/) 

function /^ represents the freedom in the cross stream 

distribution of i// and choosing f(y) to be unity 

everywhere \y can be identified as the usual Stokes 

stream function given by 

dw dw 

— = -yu y ; — = yu x 

dx dy 



Eq. (13), (circumferential component) gives 

{Wo 

dy 



= u S{yU ^ + u d{yU ^ 

x dx y 



Referring to the meridional plane figure 1, it may be 
deduced that 



u, = i 



dx __ dy 
ds y ds 



ds 



(yu a ) = 0^> yu a =C(ys) 



where q 



In terms of C(y) the vorticity vector 




dy 



•a 



y dy\ 

= s> y + co a a + co x x ,by definition. 

An expression for co a is required as this appears in the 
expression for B, so using the radial component of Eq. 
(12) gives 



ft> = 



r l_dC_^ 

y &y 



1 dH 

u x dy 



using the Stokes' stream function this becomes 



G)„ = 



C(y) 



y 



dC 



\ 



v- 



y- 



dH 



dy/ ) dy/ 



which is the required expression to be used in 
calculation of B according to definition (4). If far 
upstream the flow is assumed to be cylindrical so that all 
quantities are independent of x, then with unit density 
the equation of motion and the Stokes' Stream function 
give 

n . d P n. d P W «. 5 y n . d V 1H , 

u y =°;^r = ;^r = — \^r = u;^— = yu x 
dx dy y dx dy 



giving 



ft> = 



C(W) 



y 



dC 



y 

y d 



dy/ J 2 dif/ 



(»x+««)- 



»J 



I 



With u x (y) = ay + /3 and u a (y) = ky-\ — as 



y 



dH 



previously defined. Once has been calculated 

dy/ 

upstream it takes this value throughout the (<p, y/) since 

as is self evident the expression is independent of (p . 

This last expression for m a is required in the calculation 
of B and numerical coupling with Eq. (1) gives the 
numerical solution in the design plane. 

9. DOWNSTREAM CONDITIONS. 

Downstream a cylindrical flow condition as discussed 
below will be prescribed. Defining the pressure function 
H(y/) and the function C(iff) as 

H(V) = -(u 2 x +u 2 a ) + ^- and C{y/) = yu a 
2 p 
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for cylindrical flow radial equilibrium (from Eq. (10) 
radial component gives 



K = 2( ) + (u ) mner ~ (u )„ 



1 dp _u A t 

p dy y 



, integrating gives 



l r , f «« , f C z (yr) 

-(.P-Py- m r)= J — <fy = J T^ 



y 



TTf \ * ( 2 , 2\, P y -inner 

H(W) = -("x+"J + — 



J. 

2 



+ 



J 



7 



y-iwier 



y 



Now J £!M^ = _I Jc 2 J(l/y 2 ) 



y-- inner J 

2 //^2 \ 



y-inner 



7 






/ y-inner 



+ 



1 6?C 2 



^ I 7^r* 



y -inner ■ 



TTf \ 2 , ^ y -inner . *■ s 2\ 

•H(yf) = -",+— + -("J ri 

Z P 1 



y -inner 



+ 



J —^ dl f 



w=a y dl v 

Suppose w xl =w xl ((^)and K a>1 = u al (<y), where 
the subscript 1 denotes upstream conditions, then 
W l2 = u xi (V0 an d M a2 = u ai (V) are required as 
functions of y/ , where the subscript 2 similarly denoting 
downstream conditions, so that 

2 

; 2 4 ) 



P 2 



l r 1 dC , 
-- — rfy/ 

2^U <*V 



and 



f -^^^=-(y 2 2 ->' 2 2 , I „„ er ) 



^=0 M x,2 



thus C(l//) = y Y U a 1 = y 2 U a 2 ,and (14) now gives 



1 2 _ 1 2 . Planner Pz,ii 



r<, X f 2. ry X.,1 



- + 



1 2 2 1 f f 1 O 2 



^ov-^i ^2 J 




where 



and^ 2 2 =y 2 2 , Mer +2 J 



c/y 



(16) 



<^=0 U r.,2 



with m i2 in this case given by Eq. (15). 

10. CALCULATION PROCEDURE. 

The calculation of the downstream radii y2(f f ) follow 
from Eq. (16) with u Xr2 given by Eq. (15), which can be 
written as 



x,2 



= g(ty)+K , where 



(17) 



In order to calculate the (n+l) th iterate it is known that: 

— f 2 



) = 2j 



y=0 



dK 



dij/ 






but 



6K 



(y! 



(n) 



(V ) ( " +1) -(v 2 ) (n) 



K 



(n+l) 



-K 



(«) 



(18) 



from which as can be seen from Eq. (1 8) the K}"' must be 
calculated iteratively with K< 0) =0, Once the K^ n+i> has 
been calculated it is introduced into Eq. (17), giving rise 

to a new (u x 2 ) n which in turn gives a new 

(_y x 2 ) n from Eq. (16) and the process repeated until 
some convergence criteria is satisfied. 

11. PRESCRIPTION OF WALL 
GEOMETRIES. 

In this paper the Dirichlet boundary conditions will be 
prescribed on the wall boundaries so that it is the radii 
values, y that are given as a function of <p on the 

boundaries. The function chosen to give a y distribution 
is based on the hyperbolic tangent, choosing 
y(<p)=Ctanh(a<p+b)+k where C, a, b and k are 

constants, applying the conditions that y=y u at <p = 

andy^a- at (p = O taking aCP+6=3(arbitrary) and b=-3, 

so that tanh(a&+b) ml and tanh(b) &-1, then it follows 
that 



y(<p) = 



1 ' y d -y^ 



tanh(a#>+&) + 



f y^y^ 



(19) 



replacing <p by x in Eq. (19) gives a y(x) distribution. 

The inner radius is prescribed to be equal to unity in this 
paper (arbitrary). The geometries produced are shown in 
figures 2, 3 and 4 respectively 
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12. CONCLUSIONS. 



As shown, geometries have been produced subject to 
given upstream and downstream conditions with 
prescribed Dirichlet boundary conditions. In this case 
vorticity at inlet has been specified by defining the axial 
velocity to be of the form u x (y) =ay+j3, and the swirl 

velocity of the form U a (y)= ky + — , where the k and 

y 

I are constants, defining the so-called free and forced 
vortex whirl respectively. The downstream conditions 
where such that: cylindrical flow was present, Dirichlet 
boundary conditions were prescribed, however the case 
with Neumann conditions can be accommodated using 
the algorithm, in addition so can the case with Robin 
boundary condition. Further examples of the algorithm 
with a combination of boundary condition is given in 
Pavlika (5). Additional work has been carried out by the 
author in which the inlet (and consequently exit) axial 
velocity profile is of parabolic type modeling effectively 
viscous flow as shown in Pavlika (5). Furthermore 
Hagen-Poiseuille flow has also been modeled using the 
techniques described in this paper. In addition to the 
techniques described above the author has also looked at 
the cases when body forces exist as well as blockage 
effects created by a circumferentially arranged cascade of 
aerofoils. It was found that at most twenty iterations 
were required to achieve an acceptable level of 
convergence when these additional features were 
incorporated into the model. 
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14. FIGURES. 




Figure 1. The meridional plane. 
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Figure 2. The geometry and speed distribution (along the top boundary) produced given a Swirl velocity 
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Figure 3. The geometry and speed distribution (along the top boundary) produced given a Swirl velocity 
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Abstract 

The paper describes a method of segmented wavelet transform (SegWT) 
that makes it possible to compute the discrete-time wavelet transform of 
a signal segment-by-segment, with exactly the same result as if the whole 
signal were transformed at once. Due to its generality, the method can 
be utilized in many situations: for wavelet-type processing of a signal in 
real time or in case we want to process the signal in parallel or in case 
we need to process a long signal, but the available memory capacity is 
insufficient (e.g. in the DSPs). In the paper, the background theory and 
the emerging principles of both the forward and the inverse SegWT are 
explained. 

Keywords segmented discrete-time wavelet transform, real-time wavelet transform, 
SegWT, signal processing, algorithm 

1 Introduction 

The discrete-time wavelet transform (DTWT) has many applications in the field 
of signal and image processing today. Most of them require signals to be com- 
pletely known when the processing is initiated. The natural need for real-time 
applications originated in the development of methods allowing the computa- 
tion of the DTWT without knowing the signal in advance, and possibly with 
minimum delay at the same time. Applications arising in the image processing 
field such as wavelet compression segment-by-segment (e.g. JPEG2000 coding 
for large images) lead to the same category of methods. 

We are not interested in algorithms for transforming data received as a 
continuous stream [4] (e.g. in the FPGAs), because we suppose the data come in 
the form of nonoverlapping segments. Leaving these algorithms aside, methods 



394 



Algorithms for Segmentwise Computation of . . . Wavelet Transform 




Figure 1: The undesirable artefacts near the segments borders. The image was 
(strongly) compressed using JPEG2000 algorithm, with tiling option switched 
on; the square tiles of 128 x 128 px in size — segments — are transformed sep- 
arately. 



for computing the wavelet transform in succession can be divided into two main 
classes. 



1.1 Inexact Methods 

This class includes another two types of methods: 

The first-type methods are based on signal windowing and overlapping the 
resultant segments [2], analogous to the short-time Fourier analysis. However, 
windowing introduces (apart from potential considerable numerical errors at 
the window tails) severe problems when the wavelet coefficients arc subject to 
nonlinear processing, e.g. the thresholding step within a denoising algorithm. 

The second-type methods approach the particular signal segments indepen- 
dently — they "blindly" extrapolate samples beyond the boundaries of each 
segment. There exist several such methods, either simpler or more complex [7]. 
This approach, of course, leads to undesirable artifacts on the signal boundaries 
after the processing. 

A typical example of the described inexactness can be seen in Fig. 1. 

1.2 Exact Methods 

This class, which we are most interested in, includes methods which actually 
extend the boundaries by samples from the respective neighbouring segments. 
Most of the contributions in these problems have come from researchers working 
with images and video. They were motivated by the idea of splitting the com- 
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putation into parallel processes run on individual workstations [3, 1]. Within 
this class of algorithms, we can distinguish another two algorithm types: 

The third-type algorithms split the signal into segments which are dis- 
tributed to the particular processes. During the whole computation the pro- 
cesses do not communicate with each other. Afterwards, the results are joined 
together. Such an approach is suitable for systems where interchanging informa- 
tion (i.e. wavelet coefficients) is slow and thus increases the total computation 
time. The disadvantage is that a portion of computation is performed redun- 
dantly, in several processes. 

In the algorithms of the fourth type the processes mutually interchange data 
during computation. Clearly, this is suitable in situations where the commu- 
nication is fast. The principal advantage here is that there is no computation 
redundancy To be more concrete, wavelet coefficients located by the segment 
boundaries computed at each "level" of the transformation are interchanged. 

State-of-the-art algorithms belonging to both the third and the fourth group, 
which can be found in the literature, are derived for the special case when each 
segment length is equal to a power of two. This assumption is their drawback, 
mainly for larger segments (e.g. the difference between 1024 and 2048 can be 
inadmissibly big, considering for example that with images, 1024 2 = 10 6 and 
2048 2 = 4 • 10 6 ). Also, there are situations where the segment sizes are not a 
power of two (e.g. the signal buffer size in audio cards running with ASIO driver 
could be 96 samples). Paper [5] gives directions for the non-power-of-two case. 
However, there is one more thing: all of the algorithms mentioned are made just 
for the purpose of forward DTWT, as they are mainly used for blockwise image 
compression. Moreover, most of the published methods specialize in JPEG2000, 
which means that they are restricted to the biorthogonal wavelet CDF 9/7. 



1.3 Motivation and Goal of SegWT 

The objective for the segmented wavelet transform, denoted SegWT, is naturally 
to allow signal processing by its segments, so that in this manner we get the 
same result (i.e. the same wavelet coefficients) as in the common DTWT case. At 
the same time, SegWT should utilize as much from the DTWT computational 
routines as possible. 

In this paper we present the SegWT method, which can be utilized for any 
wavelet-type segmentwise data processing task, that is to say also in real time. 

SegWT includes both the forward and the inverse parts of the transform. 
The segment length and the wavelet filter can be chosen arbitrarily. In fact, 
SegWT is of the third type in the sense of the above. 

The derivation of the SegWT algorithm requires a very detailed knowledge 
of the behavior of ordinary DTWT, so before we start with SegWT, we recall 
the basic algorithm of DTWT. 
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2 Classical DTWT Algorithm 

Algorithm 1 (Decomposition pyramidal algorithm DTWT) Let x be a discrete 
input signal of length s, the two wavelet decomposition filters of length m are 
defined, highpass g and lowpass h, J is a positive integer denoting the decom- 
position depth. Also, the type of boundary treatment [7, ch. 8] has to be known. 

1. Denote the input signal x by et-°> and set j = 0. 

2. One decomposition step: 

(a) Extending the input vector. Extend st^' from both the left and the 
right sides by (m — 1) samples, according to the type of boundary 
treatment. 

(b) Filtering. Convolve the extended signal with filter g. 

(c) Cropping. Take from the result just its central part, so that the re- 
maining "tails" on both the left and the right sides have the same 
length m — 1 samples. 

(d) Decimation. Downsample the resultant vector. 

Denote the resulting vector by dv + J and store it. Repeat items b)-d), now 
with filter h, denoting and storing the result as a" +1 - ) . 

3. Increase j by one. If it now holds j < J , return to item 2, in the other 
cases the algorithm ends. 

After Algorithm 1 has been finished, we have the wavelet coefficients stored in 
J + 1 vectors (of different length) d J \S J '> ,d (J - 1 '> , . . . , dW. 



3 Method of Segmented Wavelet Transform 

In the problem, the following parameters play a crucial role: m . . . wavelet filter 
length, m > 0, J . . . transform depth, J > 0, s . . . length of the segment, s > 0. 
Based on a detailed knowledge of DTWT, it is possible to deduce fairly 
sophisticated rules how to handle the signal segments. It is clear that it is 
necessary to extend every segment from the left by an exact number of samples 
from the preceding segment, and from the right by another number of samples 
from the subsequent segment (extension, overlap). However, the number of such 
samples depends on m, J and s, and it can be shown that every segment has 
to be extended by a different length from the left and from the right, and these 
lengths can also differ from segment to segment! And, of course, the first and 
the last segments have to be handled in a particular way. 
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Figure 2: Scheme of signal segmentation. The input signal x (a) is divided into 
segments of equal length and the last one can be shorter than this (b). 



3.1 Important Theorems Derived from DTWT Algorithm 

Before we introduce a detailed description of the SegWT algorithm, several 
theorems must be presented. More theorems including proofs can be found in 
[6, ch. 8]. We assume that the input signal x is divided into S > 1 segments 
of equal length s. Single segments will be denoted ^^x, . . . , x. The last 
one can be less long than s and the number S does not have to be known in 
advance. See Fig. 2. The signal boundary treatment considered in this paper 
is "zero-padding" , when the boundaries are extended by zeros (most suitable 
for processing audio recordings, for example), but switching to another type of 
treatment is easy. 

By the formulation that two sets of coefficients from the k-th decomposition 
level follow-up on each other we mean a situation when two consecutive segments 
are properly extended, see Figures 2 and 3, so that applying the DTWT of 
depth k, with step 2a) omitted (cf. Algorithm 3, page 9), separately to both 
the segments, let us say "x and n+1 x, and joining the resultant coefficients 
together leads to the situation that the last coefficient computed from ™x and 
the first coefficient computed from n+1 x would be neighboring in case the signal 
is transformed by the ordinary DTWT. 

Such a situation is desirable and the theorems below lead to proper handling 
of the consecutive segments. 



Theorem 1 In case that the consecutive segments have 

r{k) = (2 fc -l)(m-l) 



(1) 



common input signal samples, the coefficients from the k-th decomposition level 
follow-up on each other. 

Thus, for a decomposition depth equal to J it is necessary to have r(J) = 
(2 J — l)(m— 1) common samples in the two consecutive segments after they have 
been extended. This extension must be divided into the right extension of the 
first segment (of length R) and the left extension of the following segment (of 
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Figure 3: Illustration of extending the segments. 



length L), while r(J) = R+ L. However, the lengths L, R > cannot be chosen 
arbitrarily. In general, the numbers L and R are not uniquely determined and 
must comply with strict rules that will be shown. The formula for the choice 
of extension L max , which is unique and the most appropriate in the case of 
real-time signal processing, is given in Theorem 2. 

For the purpose of the following, we assign the number of the respective 
segment to the variables L max , R m i n , I, so that the left extension of the n-th 
segment will be of length L max (n), the right extension will be of length R m i n (n) 
and the length of the original n-th segment with the left extension joined will 
be denoted l(n). 

Theorem 2 Let the n-th segment be given, whose length including its left ex- 
tension is l(n). The maximum possible left extension of the next segment, 
L max (n+ 1), can be computed by the formula 



imax(n + l) = Z(n)-2 J ceil 



Kn) - r{J) 
2 J 



(2) 



The minimum possible right extension of the given segment is then 

Rmin(n) = r(J) - L max (n + 1). (3) 



Theorem 3 The length of the right extension of the n-th segment, must comply 
with 



R m in(n) = 2 J ceil 



fns 



\2 J 



1,2, 



,5-2. 



(4) 



From (4) it is clear that i? m j n is periodic with respect to s with period 2 J , 
i.e. -R m i rl (n + 2 J ) = i? m j n (n). This property, among other things, can be seen 
in Table I. 
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Theorem 4 (on the total length of segment) After the extension, the n-th seg- 
ment of original length s will be of total length XX n ); which can acquire one of 
two values, either 

r(J)+2 J ceil(£) or r{J) + 2 J ceil (£) - 2 J . (5) 

The overall illustration of SegWT can be seen in Fig. 4. The particular 
algorithms are described in detail in the next sections. 

3.2 Algorithm of Forward SegWT 

The algorithm works such that it reads (receives) individual segments of the 
input signal, makes them extend each other in a proper way, then it computes 
the wavelet coefficients in a modified way and, in the end, it easily joins the 
coefficients. There is no need to know how many segments will be in total, we 
only require that in the moment when the last segment is received, we know 
that information. 

Algorithm 2 Let the wavelet filters g and h be of length m and the decomposi- 
tion depth be J. The boundary treatment mode is "zero-padding". The segments 
of length s > of the input signal x are denoted he, x, x, . . . The last segment 
contains s' < s samples. 

1. Set n = 1, last = 0. 

2. Read the first segment, hi.. Extend it from the left by r(J) zero samples. 
Update Hast'. 

3. If, at the same time, the n-th segment is the last one 

(a) Extend the n-th segment from the right by such a number of zero 
samples that its total length will be L max (n) + s. 

(b) Extend the n-th segment from the right by r(J) zero samples. 

(c) Compute the transform of depth J of the extended segment using 
Algorithm 3. 

(d) Modify the vectors containing the wavelet coefficients by trimming off 
a certain number of redundant coefficients from the left side, specifi- 
cally: on the k-th level, k = 1, 2, . . . , J — 1, trim off r(J — k) coeffi- 
cients. 

(e) Trim off redundant coefficients from the right so that on the k-th level 
floor (2 _fe (L max (S') + s'Yj coefficients remain. 

(f) Trim off the vectors in the same manner as in 3d, but this time from 
the right. 

(g) Store the result as n^J) n^j) n^,]-i) ^ _ _ _ n d (i)_ 

Otherwise 
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(h) Read the (n + l)-th segment, update 'last', 
(i) Compute L ma _ x (n + 1) and R mm (n) (see Theorem 2). 
(j) Extend the n-th segment from the right side: 
If last = 1 (i.e. we have the last segment) 
i. Compute the difference diff = max(0, i? m ; n (n) — s 1 ). 
ii. If diff > (i.e. not enough samples in the last segment for 
extension by R m i n ( n )) 

A. Extend the n-th segment from the right side by s' samples 
from the last segment. 

B. Extend the n-th segment from the right side by another diff 
zero samples. 

Otherwise 

C Extend the n-th segment from the right side by R m in samples 
taken from the last segment. 
Otherwise 

Hi. Extend the n-th segment from the right side by i? m ; n samples 
taken from the last segment. 

(k) Extend the (n+l)-th segment from the left side by L m!ix (n+l) samples 

taken from segment n. 
(1) Compute the DTWT of depth J from the (extended) n-th segment 

using Algorithm 3. 

(m) Modify the particular vectors containing the coefficients in the same 
manner as in 3d. 

(n) Store the result as n d J \ n <$ J \ n <$ J - 1 \ . . . "d^. 

(o) Increase n by 1 and go to item 3. 

Algorithm 3 This sub-algorithm is identical to Algorithm 1 with the exception 
that we omit step 2a), i.e. we do not extend the vector. 

The output of Algorithm 2 is S( J + 1) vectors of wavelet coefficients 

{ i ^ J \ i S J \ i S J - 1 \...M 1) }Lv (6) 

If we simply join these vectors together, we obtain a set of J + 1 vectors, which 
are identical to the wavelet coefficients of signal x. 
The flowchart of Algorithm 2 is in Fig. 5. 

3.3 Corollaries and Limitations of SegWT Algorithm 

In [6] several practical corollaries for SegWT can be found, e.g. that the segments 
cannot be shorter than 2 J . From the description in the above sections it should 
be clear that the delay of Algorithm 2 is one segment (i.e. s samples) plus the 
time needed for the computation of the coefficient from the current segment. It 
can be easily shown that in the special case of s being divisible by 2 J it even 
holds i?min("-) = for every n £ N (see Theorem 3), i.e. the delay of the forward 
method is determined only by the computation time! 
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Double crop redundant coefficients from 
right side 
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Lmax(n+1) samples from 

segment n 



Compute DTWT of segment n using 
algorithm mldw 



Crop redundant coefficients from left side 



Write (output) coefficients 

corresponding with 

segment n 



Figure 5: Flowchart of the forward SegWT, with zero-padding treatment of the 
signal boundaries. The main loop, which is applied to all the segment but the 
first and last ones, is emphasized by the thicker line. 
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Tabic I: Example lengths of extensions for different lengths of segments s. 
The depth of decomposition is J — 3 and the filter length is m = 16. 



s 


n 


12345678 


9 10 11 12 . 




512 


L mas (n) 

-RminO) 


105 105 105 105 105 105 105 105 

00000000 

617 617 617 617 617 617 617 617 


105 105 105 105 . 

0. 

617 617 617 617 . 




513 


L mas (n) 

flminO) 


105 98 99 100 101 102 103 104 

76543210 

625 617 617 617 617 617 617 617 


105 98 99 100 . 

7 6 5 4. 

625 617 617 617 . 




514 


Rmin(n) 
E(«) 


105 99 101 103 105 99 101 103 

64206420 

625 617 617 617 625 617 617 617 


105 99 101 103 . 

6 4 2 0. 
625 617 617 617 . 




515 


Hmln(n) 


105 100 103 98 101 104 99 102 

52741630 

625 617 625 617 617 625 617 617 


105 100 103 98 . 

5 2 7 4. 

625 617 625 617 . 





























3.4 Few Examples 



• For J = 5 and m = 8, the minimum segment length is 32 samples. When 
we set s = 256, R m in will always be zero and L max = r(5) = 217. The 
length of every extended segment will be 256 + 217 = 473 samples. 

• For J — 5 and m = 8 we set s = 300, which is not divisible by 2 5 . Thus 
will alternate with period 8 such that < R m in < 31 and 



The total length of segment after extension will be 



either 505 or 537. 



(Example illustrated in Fig. 4) For J = 3, m = 4, s = 92, the extensions 



will alternate between two states, cither i? n 



4 and L n 



17 or 



Rmin — and £ m ax 

109 or 117 samples. 



21. The length of the extended segments will be 



The increase of the samples entering the computation is naturally a price 
paid for the fact that no errors will originate during processing the boundaries. 



3.5 Algorithm of Inverse SegWT 

The inverse algorithm is described below, in less detail than the forward one. 
Blocks of wavelet coefficients (6) produced segment-by-segment by the forward 
SegWT constitute the input for the inverse algorithm. Analog to the forward 
case, we use the boolean flag last, which becomes true if the very last segment 
has to be processed. 

In addition to that, due to the downsampling step of the forward transform, 
we loss information about the total length of the signal, more precisely we 
do not know if the original length was a or a + 1 for some integer a. We 
could solve this problem by accumulating the lengths of individual inverted 
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segments, however, such a number could be very large, possibly overflowing the 
processor arithmetics. A better solution is just to keep the signal parity (i.e. if 
the accumulated length is is even or odd). The information is then used at the 
very end of the signal for deciding to cut or not to cut the last reconstructed 
sample. 

The inverse SegWT partly utilizes the overlap-add principle for joining the 
reconstructed pieces of the time-domain signal. The length of the overlap stays 
r(J) all the time. As for the illustration, we again refer to Fig. 4. 

Algorithm 4 Let the decomposition depth J be given, as well as wavelet recon- 
struction filters g and h of length m, and coefficients "a( J ) ; n d( J ), n d( J_1 \ . . . , n dW 
for all n. 

1. Set n = 1. Set last = 0. 

2. If last = 1 then the Algorithm ends. 

3. Read the n-th block of coefficients and update 'last'. 

4- Extend the detail coefficients: on the k-th level, k = 1, . . . , J — 1, append 
r(J — k) zero coefficients from the left side. 

5. Compute the inverse transform of depth J using Algorithm 5. 

6. If n =/= I, recall the samples for the overlap, saved in the last cycle, and 
add them to the current inverted block. 

7. Update the parity of the signal. 

8. If last 7^ 1, append the central, non-overlapping part to the output. Save 
the samples of the overlap of the current inverted segment for the next 
cycle. 

Otherwise Append the whole inversion to the output. Eventually crop 
several samples from the end of the signal. 

9. The output (a segment of a time-domain signal) is now complete and pre- 
pared to be "sent". 

10. Increase n by 1 and return to item 2. 

Algorithm 5 This algorithm is identical to the ordinary inverse wavelet trans- 
form (i.e. upsampling -filtering - summing - cropping), but the cropping phase 
is omitted here. 

The flowchart of Algorithm 4 can bee seen in Fig. 6. 



Pavel Rajmic 
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Append whole remaining part of the 
inversion to output 



Eventually crop several samples 
from end of inverted signal 



Append non-overlapping (central) part of 
inversion to output 



Save the overlap of the current 
segment for adding up in next cycle 



Output related to 
segment n 



n=n+1 

— C^ 



Figure 6: Flowchart of the inverse SegWT. The main loop is emphasized by the 
thicker line. 
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3.6 Joining Forward and Inverse Parts to Form Algorithm 
Capable of Real-Time Performance 

The algorithms in Sec. 3.2 and 3.5 were presented separately for clarity. How- 
ever, their easy integration into a simple joint loop forms a universal algorithm 
for any wavelet-type processing task in real time. It can be shown that in the 
case of s being divisible by 2 J the total delay is no bigger than s samples, in 
other cases no bigger than 2s. 
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APPROXIMATING COMMON FIXED POINTS BY AN 
ITERATIVE PROCESS INVOLVING TWO STEPS AND 

THREE MAPPINGS 

SAFEER HUSSAIN KHAN 

Abstract. Agarwal et al [1] introduced an iteration process which 
involves two steps and one mapping. They proved some results us- 
ing nearly uniformly /c-contractions. On the other hand, Berindc 
[2] introduced a new class of quasi-contractive type operators on 
a normed space. First we compare these two types of mappings. 
We then modify both Agarwal's process and mappings of Berindc 
to the case of three mappings keeping the number of steps same. 
We use this modified process first to prove some strong conver- 
gence theorems to approximate common fixed points of three quasi- 
contractive operators in normed spaces and then three nearly uni- 
formly fc-contractions in uniformly convex Banach spaces. This 
will generalize corresponding results of Berinde [2] , Agarwal et al 
[1] and unify a number of results. 



1. Introduction and Preliminaries 

Let C be a nonempty convex subset of a normed space E and T : 
C — *■ C be a mapping. Let {a n } be appropriately chosen sequence 
in (0, 1). Throughout this paper, N will denote the set of all positive 
integers, / the identity mapping on C, F(T) the set of all fixed points of 
T and F = flf =1 F(Tj), the set of common fixed points of the mappings 
Ti:C->C,i = 1,2,3. 

The Mann iterative process [5] is defined by the sequence {x n }: 



, n , Xi — x £ C, 

1 x n+ i = (1 - a n ) x n + a n Tx n , n e N 



where {a n } is in (0, 1). 



2000 Mathematics Subject Classification. 47H10,54H25 . 

Key words and phrases. Quasi-contractive type operator, Nearly uniformly k- 
contraction, Iterative process, Common fixed point, Strong convergence. 
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The sequence {x n } denned by 

{Xi = x G C, 
x n +i = (1 - a?i) x n + a n Ty n , 
y n = (1 - f3 n ) x n + (3 n Tx n , n G N 

where {«„} and {/3 n } are in (0, 1), is known as the Ishikawa iterative 
process [4]. 

Recently, Agarwal et al [1] introduced the following iterative process: 

{X\ = x G C, 
x n+1 = (1 - a n )Tx n + a n Ty n , 
y n = (1 - f3 n ) x n + (3 n Tx n , n G N 

where {«„} and {f3 n } in (0, 1). 

Note that neither (1.1) nor (1.2) can be deduced from (1.3). They 
defined nearly uniformly /c-contraction as a mapping T : C — > C satis- 
fying 

(1.4) \\T n x-T n y\\<k(\\x-y\\+a n ) 

where < k < 1 and {a n } is a sequence in [0, oo) with a n — > 0. Clearly 
every contraction is nearly uniformly /c-contraction. They proved the 
following strong convergence result. 

Theorem 1. Let E be a uniformly convex Banach space and let C be 
its closed and convex subset. Let T : C — > C be a nearly uniformly 
k-contraction with a sequence {a n } and F{T) ^ such that Y^=i a n < 
oo. Define a sequence {x n } in C as: 

x\ = x G C, 

%n+l [,*■ Cin)-'- %n ~r O^n-*- yni 

y n = {l-(3 n )x n + (3 n T n x n , neN 

where {a n },{/3 n } are in (0, 1). Then {x n } converges strongly to a fixed 
point of T . 

On the other hand, Berinde [2] introduced a new class of quasi- 
contractive type operators on a normed space E satisfying 

(1.5) \\Tx - Ty\\ < S \\x - 3/|| + L \\Tx - x\\ 

for any x,y G E,0 < S < 1 and L > 0. This class of mappings is larger 
than not only contractions but also Kannan mappings and Zamfirescu 
operarors. For details, see [2]. 

The following comparison of the defintions (1.4) and (1.5) shows that 
Theroem 1 does not cover the above type of operators. 
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Proposition 1. (1.4) does not imply (1.5) in general. However, if T 
is identity mapping or x is a fixed point of T or L = 0, then we must 
choose a n identically zero. 

Proof. From (1.4) , 

\\T n x - T n y\\ = \\T {T n - l x) - T (T^y) || 

< 5 \\T n - l x - T n - l y\\ + L \\T n x - T n - l x\\ 

< 5(5 \\T n - 2 x - T n - 2 y\\ + L W^^x - T n - 2 x\\) 
+L\\T n x-T n - 1 x\\ 

= 5 2 \\T n - 2 x - T n - 2 y\\ + 5L \\T n ~ l x - T n - 2 x\\ 
+L\\T n x-T n - 1 x\\ 

< 5 3 \\T n - 3 x - T n - 3 y\\ + 5 2 L \\T n - 2 x - T n - 3 x\\ 
+5L llT^x - T n - 2 x\\ + L \\T n x - T n - l x\\ 



5 11 - 1 \\Tx - x\\ + 5 n - 2 \\T 2 x - Tx\\ + ■ ■ 
+5 IIT™-^ - T n - 2 x\\ + \\T n x - T n ~ l x\ 



— " II'*-' ilW ' ■'-' \ i X WTn— 1™ rp n — 2„|| I \\Tn~, T^n—l 



That is, 
(1.6) 

/ pn-l || j> _ || , rn-2 ||j,2 _ rp || , 
1 1 rrm /yin 1 1 ^ r^i 11 1 1 I 7~ I II 1 1 ~ II *^ 1 1 ~ 

11 X y|| -° 11^ 2/ II -+- ^ +< J HT"-^ - T n - 2 rc|| + ||T n x - T"" 1 ^! 

But 

||T 2 x — Tx\\ = \\T (Tx) — Tx\\ 

< 5 \\Tx — x\\ + L \\x — Tx\\ 
= (5 + L) \\x — Tx\\ , 

and 

||T 3 a;-T 2 x|| = \\T (T 2 x) - T (Tx)\\ 

< 5 \\T 2 x — Tx\\ + L \\T 2 x — Tx\\ 
= (5 + L)\\T 2 x-Tx\\ 

< (5 + L) 2 \\x - Tx\\ 

and so on , we get 

\\T n x - T^xll <{5 + L)^ 1 \\x - Tx\\ 
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Tx-x\\ +5 n - 2 \\T 2 x-Tx\\ + 

ra-l 



+8 IIT"-^ - T n - 2 x\\ + \\T n x - T 



8 



X 



"- y + 5 n - 2 (8 + L) + 5 n - 3 (8 + L) 2 + -- 
+5(5 + L) n - 2 + (5 + L) n ' 1 

+(¥r 2 +(¥) 

5 n-i/ 1 + (i + f) + (i + f) 2 + -;- 

' • +(i+f)""+(i+tr 1 



\x — Tx\ 



x — Tx\ 



\x — Tx\ 



n— 1 



8 

T 



(i+ir 



L 



S 



- 1 



|x — Tx\ 



\x — Tx\\ . 



Hence (1.6) becomes 



\T n x-T n y\\ < 8 n \\x-y\\+L 



8 n 

T 



i + 



L 



8 



8 n \\x-y\\+8 r 



= 8 r ' 

< 8 
Choose k = 8. Define 



\\x — y\\ + 
x — y\\ + 



1 + 
1 + 
1 + 



L 
J 

L 

J, 

1 



-1 


\\x — Tx 


1 | \x — Tx\\ 


. \\x — Tx\\ J 


\\x 


-Txij. 



1 + 



8 



\x — Tx\ 



If T is identity mapping or a; is a fixed point of T or L = 0, then a n is 
identically zero. If L ^ 0, then 1 + f G (1, oo) and so (l + 4) diverges. 
Hence a„ -^ as n — > oo. Consequently, any mapping satisfying (1.4) 
does not satisfy (1.5) in general. □ 

Berinde [2] used the Ishikawa iterative process (1.2) to approximate 
fixed points of the class of operators (1.5) in a normed space. Actually, 
his main theorem was the following: 
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Theorem 2. Let C be a nonempty closed convex subset of a normed 
space E. Let T : C — ► C be an operator satisfying (1.5). Let {x n } be 
defined by the iterative process (1.2). If F(T) ^ and X^°=i a n = 
oo, then {x n } converges strongly to a fixed point ofT. 

Now note that although the iterative process used in Theroem 1 is 
better than the one used in Theorem 2 (see [1]) but it does not cover 
the type of operators used in Theorem 2. We, thus, need a modification 
for both of these. Keeping in mind that approximating common fixed 
points has a direct link with the minimization problem, see for example 
[6], we modify both (1.3) and (1.5) to the case of three mappings Ti,T 2 
and T 3 as follows. 

= x e C, 
(1-7) { x n+1 = (1 - a n )Tix„ + a n T 2 y n , 

= (1 - /3J x n + (3 n T 3 x n , neN 

where {a n } and {f3 n } are in (0, 1) and 

(1.8) max ||Tj.x — T^t/II < 5 \\x — y\\ + L max \\TiX — x\\ 

i£{l,2,3} ' «e{l,2,3} 

for any x,y <E E,0 < 5 < 1 and L > 0. 
We note that (1.7) reduces to 

• (1.3) when Ti = T for all % — 1, 2, 3, 

• (1.2) when T x = /, T 2 = T 3 = T, 

• (1.1) when Ti = T 3 = /, T 2 = T. 

We also note that (1.8) reduces to (1.5) in any of the following cases: 

• when T{ = T for alH = 1, 2, 3, 

• when any two of T, i = 1, 2, 3 equal T and the third one is J, 

• when two of T, i = 1, 2, 3 are identity but the third one is T. 

In the rest of the paper, we use the iterative process (1.7) where the 
mappings satisfy (1.8) to prove a common- fixed-point-result in normed 
spaces. A corollary to this result will cover the case of operators defined 
by (1.5) using (1.3). We also obtain generalizations of some results of 
[2]. Moreover, we use a variant of (1.7) to prove a result for nearly 
uniformly ^-contractions in Banach spaces thereby generalizing a result 
of Agarwal et al [1] . 

2. Common fixed points by a two steps three mappings 

PROCESS 

2.1. Results in normed spaces. Our first theorem deals with the 
iterative process (1.7) for the mappings defined in (1.8). 
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Theorem 3. Let C be a nonempty closed convex subset of a normed 
space E. Let T, : C — ► C, i — 1, 2, 3 6e three operators satisfying (1.8) 
and F^0. Let {rc n } fee defined by the iterative process (1.7). if {a„} 
and {/3 n } are sequences in (0, 1) snc/i that Y^Li a nP n = oo, t/ien {x„} 
converges strongly to a point of F. 

Proof. Let w <E F. Then 

||xn+i-HI = WO- - <Xn)Tix n + a n T 2 y n - w\\ 

(2.1) < (1 -a„) \\TiXn- w\\ + a n \\T 2 y n - w\\ . 

Since \\T 2 y n — w\\ < maxj e { 12 ,3} \\Tiy n — w\\ , therefore for x = w and 
V = Vn, (1-8) gives 

(2.2) ||T 2 y„ - iu|| < 5 \\y n - w\\ 
Simillarly, the choice x = w and y = x n provides 

(2.3) \\T 3 x n - w\\ < 5 \\x n - w\\ . 

But 

\\y n -w\\ < (1- f3 n )\\x n -w\\+ f3 n \\T 3 x n -iv\\ 
< (1 - (5 n ) \\x n - w\\ + (5 n 5 \\x n - w\\ 

(2.4) < (l-p n (l-8))\\x n -w\\. 

Then using of (2.1) through (2.4) , we obtain 

||x„+i - iu|| ^ 0- a n)\\Tix n -w\\+a n \\T 2 yn-w\\ 

< (1 - a n )5 \\x n - w\\ + a n 5 (1 - (3 n (l - 5)) \\x n - w\\ 

= [(1 - a n )5 + a n S(l - (3 n (l - 5))] \\x n - w\\ 

= 5[(1 - a n + a n - a n (3 n (l - 5)}\\x n -w\\ 

= S[(l - a n /3 n (l - 8)] \\x n - w\\ 

By induction, 



\x n+1 -w\ 



< J][l-(l-<5)a fc /3 fc ]||xi-HI 
fc=i 

(n 
y — (1 — 5) a k (3 k 
k=i 

(n 
-(1 -5)^2a k (3 k 
k=l 



for all n e N. 
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Since < 5 < 1, a n ,(3 n G (0, 1) and Y^m=i a nP n = oo, we get that 
limsup \\x n — w\ < limsup \\xi — w\\ exp — (1 — 5) y^akf3 k J < 0. 

n-oo n^oo ^ fc=1 J 

Hence linin^oo \\x n — w\\ = 0. Consequently x n — > w G F. This com- 
pletes the proof. D 

Although the following theorem is a corollary to our above theorem 
yet it is new in itself. This theorem also complements and improves 
a result of [1] to the case of the operators defined by (1.5) in normed 
spaces. Actually it is a blend of Theorems 1 and 2 and covers the case 
of operators defined by (1.5) using (1.3). 

Theorem 4. Let C be a nonempty closed convex subset of a normed 
space E. Let T : C — > C be an operator satisfying (1.5) and F{T) ^ 0. 
Let {x n } be defined by the iterative process (1.3). If {a n } and {f3 n } are 
sequences in (0,1) such that Y^=i a nP n = oo , then {x n } converges 
strongly to a fixed point of T. 

Proof. Choose T\ = T2 = T3 = T in Theorem 3. □ 

Note that the following cannot be obtained as a corollary by using 
the iterative process (1.3). However, we can get it by using (1.7). Note 
also that it is Theorem 1 of [2]. Furthermore, the following corollary 
generalizes Theorem 2 of [3] and the results generalized therein. Thus 
our result also unifies a number of results in the literature. 

Corollary 1. Let C be a nonempty closed convex subset of a normed 
space E. Let T : C — > C be an operator satisfying (1.5) and F{T) ^ 0. 
Let {x n } be defined by the iterative process (1.2). If {a n } and {(3 n } 
are sequences in (0, 1) such that Y^=i a nP n = 00, then {x n } converges 
strongly to a fixed point of T. 

Proof. Choose 7\ = T 3 = J, T 2 = T in Theorem 3. □ 

Similarly, we have: 

Corollary 2. Let C be a nonempty closed convex subset of a normed 
space E. Let T : C — > C be an operator satisfying (1.5) and F(T) 7^ 0. 
Let {x n } be defined by the iterative process (1.1). If {a n } is a sequences 
in (0,1) such that Y^Li a n — 00, then {x n } converges strongly to a 
fixed point of T. 

Proof. Choose T\ — I, T2 = T3 = T in Theorem 3. □ 



414 



SAFEER HUSSAIN KHAN 



2.2. Results in uniformly convex Banach spaces. Here we gen- 
eralize Theorem 3.7 of [1] to the case of three nearly uniformly k- 
contractions with a sequence {a n } . We define {x n } in C as: 



(2.5) 



Xi = x E C, 

x n +i = (1 - a n )T?x n + a n T£y n , 

y n = (l- P n ) x n + P n T£x ni n e N 



where {«„} and {j3 n } are in (0, 1). 

Theorem 5. Let C be a nonempty closed convex subset of a normed 
space E. Let 7$ : C — ► C, i = 1,2,3 be three nearly uniformly k- 
contractions with a sequence {a n } and F ^ such that Yl™=i a n < °°- 
Let {x n } be defined by the iterative process (2.5). If {a n } and {f3 n } 
are sequences in (0, 1), then {x n } converges strongly to a common fixed 
point of Tj, % = 1,2, 3. 

Proof. Let w G F. Then 






71+1 



w 



< 
< 



||(1 - a n )T?x n + a n T£y n - w\\ 

(1 - a n ) \\T™x n - w\\ + a n \\T£y n - iy|| 

(1 - a n )k (\\x n - iu|| + a n ) + a n k (\\y n - w\\ + a n ) 



k[(l 



a. 



t ) \\x n - w\\ + a n \\y n - w\\ + a n ] 



< 



< 



(1 - a n ) \\x n - w\\ + a n (1 - (5 n ) \\x n - w\ 
+a n {3 n \\T£x n - iy|| + a n 

(1 - a n ) \\x n - iu|| + a n (1 - (5 n ) \\x n - w\ 
+ka n (3 n \\x n - w\\ + ka n (3 n a n + a n 

(1 - a n + a n (1 - (5 n ) + ka n (3 n \\x n - w\\ 
-\-Kct n p n a n "t a n 

< k [(1 - (1 - k) a n (3 n ) \\x n -w\\ + (k + l)o„] 

< k \\x n — w\\ + k(k + l)a n 

< \\x n - w\\ + k(k + l)a n 

It is well-known that if {r n } and {s n } are sequences of nonnegative 
real numbers such that r n+1 < r n + s n and Y^Li s n < °°5 then limr n 

- iu|| exists. Call it c. If c > 0, then a„. — »■ 



exists. Thus lim ||x r . 
together with ||x n+ i 
contradiction. Hence lim ||a; n - 
a common fixed point of Tj, % 



' n 



'v. 



w 



+ k(k + l)o„ gives c < fee, a 
iu|| = and {x n } converges strongly to 
= 1, 2, 3 as required. □ 
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Remark. In the above theorem, ifT\ is a nearly uniformly k\- contraction 
with a sequence {a* } , T 2 is a nearly uniformly k 2 - contraction with a se- 
quence {a^} andT 3 is a nearly uniformly k 3 - contraction with a sequence 
{a\} , then we can choose a n = min(a^, a^, a^) and k = mm^ikzha) 
so that our result still remains valid. 

Following is the Theorem 3.7 of [1] which we can obtain now by 
choosing Ti = T 2 = T 3 = T in Theorem 5. 

Corollary 3. Let E be a uniformly convex Banach space and let C be 
its closed and convex subset. Let T : C — ► C be a nearly uniformly 
k-contraction with a sequence {a n } and F{T) ^ such that Y^Li a n < 
oo. Define a sequence {x n } in C as in (1.3) where {a n }, {{3 n } are in 
(0, 1). Then {x n } converges strongly to a fixed point ofT. 
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Abstract 

For the study of Z n -periodic bounded measurable functions H which 
are low pass filters in an multiresolution analysis defined on L 2 (R n ) with 
a dilation given by a fixed linear invertible map A : E n — > E n such that 
A(Z") C Z n and all (complex) eigenvalues of A have modulus greater 
than 1, one should assume that the infinite product Il^Li \H((A*)~ : 't)\ 
converges almost everywhere on R n and is A*-locally nonzero at the origin, 
where A* is the adjoint map of A. In this paper we find a condition on the 
regularity of H at the origin which assures that the above requirements 
on the infinite product hold. Moreover, depending of the regularity we 
assume on H we get different necessary and sufficient conditions on H to 
be a low pass filter in an A-MRA following the strategy of Lawton. 

Keywords: Fourier transform, Holder continuous function, Lawton's condi- 
tions, locally nonzero function, low pass filter in a multiresolution analysis. 

1 Introduction and Definitions. 

A multiresolution analysis (MRA) is a general method introduced by Mallat [21] 
and Meyer [22] for constructing wavelets. Afterwards, the concept of MRA was 
considered on L 2 (M. n ), n > 1, (see [20], [11], [26], [27]) in a more general context, 
where instead of the dyadic dilation one considers the dilation given by a fixed 
linear invertible map A : R™ — > R™ such that A(Z n ) C Z" and all (complex) 
eigenvalues of A have modulus greater than 1. Here and further we use the 
same notation for the linear invertible map A and its matrix with respect to the 
canonical base. Given such a linear invertible map A one defines an A-MRA 
as a sequence of closed subspaces Vj, j € Z, of the Hilbert space L 2 (M. n ) that 
satisfies the following conditions: 

(i)VjgZ, VjCV j+1 ; 

(ii) Vj G Z, /(x) G Vj «• I{Ak) G V j+1 ; 
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(iii) U j& Vj = L 2 (R n ); 

(iv) There exists a function <f> £ Vo, that is called scaling function, such that 
{ </>(x — k) : k € Z™ } is an orthonormal basis for V^. 

Properties of scaling functions have been studied by several authors (see 
[21],[15],|9],[4],[11],[1],[8],[14],[19],[5]). ^ 

In this paper, we adopt the convention that the Fourier transform of a func- 
tion / £ L l {R n ) n L 2 (M") is defined by 



/(y) = I /(x)e- 2 - x '^x. 



If is a scaling function of an A-MRA, observe that d^ 1 (f)(A^ 1 x) e V_i C 
Vo, where 4a = |detA|. By the condition (iv) we express this function in terms 
of the orthonormal basis {</>(x — k) : k e Z™} as 

d^V(^ _1 x) = JZ a k <£(x-k), 

keZ" 

where the convergence is in L 2 (R") and {ak}kez™ € ^ 2 - Taking the Fourier 
transform, we obtain 

0(A*t) = H(t)cj>(t) a.e. on R" 
where A* is the adjoint map of A and 

F(t) = J^ a ^ 

kGZ' 



^-27rik-t 



is a Z"-pcriodic function which is called low pass filter associated with the 
scaling function (f>, or shortly low pass filter. We study the problem of when a 
given measurable function H is a low pass filter in an A-MRA assuming some 
regularity on H. 

Before formulating our results let us introduce some notation and definitions. 

Let {ej™ =1 be the natural basis of R n , T™ = R"/Z™ and if we set / G 
L 2 (T n ) we will understand that / is defined on the whole space R™ as a Z" - 
periodic function. With some abuse of the notation we consider also that T™ is 
the unit cube [0, 1)". 

We will denote B r = {x e R" : |x| < r}. For a set E C R™ and a point 
x £ R" we will write x + E = {x + y : for ye E}. The Lebesgue measure of 
a measurable set _E C R n will be denoted by \E\ n and by \e the characteristic 
function of the set E i.e. X-eW takes the value 1 if t £ E and otherwise. 

Given AT £ {1,2,...}, the set of N times diffcrcntiable functions 
/ : R n -► C will be denoted by C N (R n ). 

We will say that a measurable function / : R" —>■ R is Holder continuous 
at x € R™ (cf. [24]) if there exist an open neighborhood of x , U C M. n , and 
constants C, a > such that 

|/(y)-/(x )|<C|y-x r, VyeCA (1) 
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If a=l, / is said to be Lipschitz continuous at Xo- 
In [5] the following definitions were introduced. 

Definition 1. We will say that x e 1™ is a point of A-density for a set E C K n , 

\E\ n > 0, if for any r > 

.. |^n(A-J'B r + x)|„ 

iim — : ; = 1. 

j-oo \A~3B r \ n 

Definition 2. Let / : R n — > C be a measurable function. We say that x 6 R™ 
is a point of ^4-approximate continuity of the function / if there exists E C K", 
|i?| n > 0, such that x is a point of A-dcnsity for the set E and 

lim /(y) = /(x). 
y -> x 

y£E 

Definition 3. A measurable function / : W 1 — > C is said to be A-locally nonzero 
at a point x € R" if for any e, r > there exists j £ N such that 

| { y e A-^'B r + x : /(y) = } |„ < e\A^ B r \ n . 

For a given e L 2 (M™), set 

$^(t)= X! i^ t+k )i 2 - ( 2 ) 

keZ" 

If A : R n —> R n is a linear invcrtiblc map such that A(Z n ) C Z" and all 
(complex) eigenvalues of A have modulus greater than 1, the quotient group 
Z n /A(Z n ) is well defined, then we will denote by A^ C Z n a full collection 
of representatives of the cosets of Z n /A(Z n ). Recall that there are exactly cIa 
cosets (see [11] and [27, p. 109]). 

Let us fix A^» = {PiJi^Q 1 , where po = 0. 

Since A* is a linear invertiblc map such that all (complex) eigenvalues of 
A* have modulus greater than 1 (cf. [27, p. 122], [2]) there exist K > and 
< 13 < 1 such that 

\(A*)-H\<K/P\t\ Vje{l,2,...}. (3) 

Given H <G L°°(T n ) the following continuous linear operator 

P:L l {T n )^L l {T n ): 

d A -i 
Pf(t)= J2 \H((A*)-\t + Pi ))\ 2 f((A*)-\t + Pi )) 

is well defined. This operator was first introduced by M. Bownik [2] as a gen- 
eralization of the analogous operator introduced by W. Lawton [18] for dyadic 
dilations. 
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2 History References. 

A. Cohen [6] gave the first necessary and sufficient conditions for a trigonometric 
polynomial H to be a low pass filter of an MRA on L 2 (M). Those conditions 
were extended for differentiable functions by E. Hernandez and G. Weiss [14] 
and for the Holder continuous functions by R. F. Gundy [12]. Furthermore, 
Cohen's approach was studied by M. Papadakis, H. Sikic and G. Weiss [24] to 
low pass filters that are Holder continuous at the origin. 

At the same time as Cohen's condition appeared, W. Lawton [17] gave an- 
other sufficient condition of a different nature when H is a, trigonometric poly- 
nomial. The necessity of Lawton's condition was settled in 1990 by both A. 
Cohen (see [7]) and W. Lawton [18], independently, (see [9, p. 182-193]). 

For our general case when an MRA is defined on L 2 (M. n ), n > 1, and for 
dilations given by a map A as above described, a generalization of Cohen's con- 
ditions for low pass filters associated with characteristic scaling functions was 
proved by K. Grochcning and W. R. Madych [11] and by W. R. Madych [20]. 
Afterwards, a generalization of Cohen's and Lawton's conditions were obtained 
by M. Bownik [2] where the results were presented with more general assump- 
tions about the regularity of low pass filters. Other necessary and sufficient 
conditions on trigonometric polynomial low pass filters appeared in the paper 
by J. C. Lagarias and Y. Wang [16]. 

The problem of characterization of low pass filters of an MRA was posed in 
the book by E. Hernandez and G. Weiss [14]. 

Characterizations of low pass filters for an MRA on L 2 (U.) and the dyadic 
dilations arc already known, see the papers by M. Papadakis, H. Sikic and G. 
Weiss [24] and by V. Dobric, R. F. Gundy and P. Hitczcnko [10]. Afterwards, 
R. F. Gundy [13] addressed the same question when the condition (iv) in the 
definition of MRA is relaxed by assuming that {4>(x — k) : fc G 2} is a Riesz 
basis for Vq. The author [25] proved another necessary and sufficient condition 
on low pass filters following the strategy of Lawton. In fact, that condition was 
even presented on low pass filters H in an A- MRA defined on L 2 (R n ). Such a 
condition is written below. 

Let H^ be the class of all functions H £ L°°(T n ) such that the infinite prod- 
uct n^Li l^((^*)~"'t)l converges almost everywhere on M" and is A*-locally 
nonzero at the origin. 

Moreover, let n^ be the class of all measurable functions on R" such that 
/(0) = 1, < /(t) < 1 a.e. on K™ and the origin is a point of A*-approximatc 
continuity of /. 
Theorem A. Let H € H^. Then the following conditions are equivalent: 

A) The function \H\ is a low pass filter associated with a scaling function 
6 of an A-MRA where 0(t) := \[f =1 \H{{A*)~h)\. 

B) The only function f G L (T n ) H-Ha invariant under the operator P 
is the function / = 1. 

To give a complete characterization of all low pass filters associated with 
scaling functions, we need also the following remark done in [25] . 
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Remark A. A measurable function H is a low pass filter of an A-MRA if and 
only if \H\ is a low pass filter of some A-MRA. 

Observe that it is not always easy to check whether a function H £ _L°°(T n ) 
belongs to the class H^. In this paper we give a condition on the regularity at the 
origin of the function H which assures that H £ H^. Moreover, depending on 
the regularity we assume on H we prove other necessary and sufficient conditions 
on H to be a low pass filter in an A-MRA following the strategy of Lawton. 
Those conditions do not appear in the literature and are new even for low pass 
filters in an MRA defined on L 2 (M.) with the dyadic dilation. 

3 Main Results 

We prove the following results. 

Lemma 1. Let H £ L°°(T n ) be a function such that \H(0)\ = 1, \H\ is Holder 
continuous at the origin and 

d A -i 
J2 \H(t+(A*)- 1 p l )\ 2 = l a.e. onW 1 , (4) 

i=0 

then H £ Ha- 

In order not to repeat conditions let us introduce the two following classes 
of measurable functions. 

Ta = {/ £ L(T") : /(0) = 1, /is continuous at the origin 

and < /(t) < 1, a.e. on W 1 }. 

If a function / £ Ta is also diffcrcntiablc at the origin, we will say that / 
belongs to the class A^. 

Theorem 1. Let H be a measurable function such that |i?(0)| = 1, \H\ is Z n - 
periodic continuous and Holder continuous at the origin. Then the following 
conditions are equivalent: 

I) The function H is a low pass filter in an A-MRA. 

II) The only function f £ T a invariant under the operator P is the 
function / = 1 . 

Theorem 2. Let H be a measurable function such that \H{0)\ = 1 and \H\ is a 
Z" -periodic differentiate function. Then the following conditions are equivalent: 

1) The function H is a low pass filter in an A-MRA. 

2) The only function f £ Aa invariant under the operator P is the 
function / = 1 . 
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4 Proof of Lemma 1 

Proof of Lemma 1. First of all, we prove that the infinite product 

DC 

H\H((A*)-h)\ (5) 

3=1 

converges almost everywhere on R™ to a well defined measurable function. 

According to the condition (4) there exists a measurable set E C R n , 
\E\ n = 0, such that < \H(t)\ < 1 for every t € R n \ E. Let F = 
^T=-oo(A*) k E. 

Given e > 0, we set two positive numbers r, R > such that U%L (A*)~ k B r C 
B R dU and 

oo 

CK a R a J2 P Ja < e> 

where {/ is the open neighborhood in the definition of Holder continuous at the 
origin and C, a, if and (3 are the corresponding constants in the inequalities (1) 
and (3). Let S = U^ =0 (A*)- k B r . 

lit £ S\F, then for every j e {1, 2, ...} 

< 1 - \H((A*)- j t)\ < CK a [3 ja \t\ a < CK a j3 3a R a . 

Thus, for every J e {2,3, ...} 

.7 



< 1-\{\H((A*)-H)\ 

< i - n i^((^) _j t)i + n i^((^)- j t)ii - ^((a^-h) 

J OO 

< CK a R a J2 P ja < CK a R a J2 P Ja < e- 



3 = 1 3=1 

Letting J — > oo we obtain 

oo 

l-s<Y[\H((A*)- j t)\<l, VteS\F. (6) 

3 = 1 

Furthermore, given t e R n \ F there exits N G {1,2, ...} such that (A*) _Jv t € 
S \ F, then 

AT oo 

[] l^((^)- j t)| [] \H((A*)-i((A*)- N t))\ 

3=1 3=1 

converges. So, we conclude that the infinite product (5) converges a.e. on R" 
to a well defined measurable function. 

Finally, by (6), 9 is A*-locally nonzero at the origin. □ 
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5 Proof of Theorem 1 and Theorem 2. 

In the proof of Theorems 1 and 2 we will need the following results. 

The following characterization of scaling functions in a multircsolution anal- 
ysis was given in [5]. 
Theorem B. Let G L 2 (M. n ). Then the following conditions are equivalent: 

(A) The function (f> is a scaling function of an A-MRA; 

(B) (a) The function 4> is A* -locally nonzero at the origin; 
(P)^ 4> (t) = l a.e. on T"; 

(-f) There exists a Z™ -periodic function, H e L°°(T n ), 
\H(t)\ < I a.e. onR", such that 

4>(A*t) = H(t)<j>{t) a.e. on R n ; 

(C) (a*) Setting \4>(0)\ = 1, the origin is a point of A* -approximate con- 
tinuity of \4>\; and the conditions (j3) and (7) hold. 

Proposition A. Let H <G L°°(Y n ) be a function such that (4) holds. Lf the 
infinite product Y\j=i \H{{A*)~ J t)\ converges almost everywhere then 

a) the function 0(t) := \[f =1 \H{{A*)~h)\ belongs to L 2 (R n ) and 

II 9 ||l 2 (m«) < l; 

b) $ e (t) < 1 a.e. onW 1 ; 

c) $g is a fix point for the operator P, 

where the function 9 is defined by 8(t) :— Ylj=i l-^((^*) J t)|. 

In the above proposition, the condition a) was proved by M. Bownik [2] (cf. 
[9], [14]), the condition b) was proved in the proof of main result in [25] and the 
condition c) also was proved in [2]. 

Remark B. If in Proposition A we add the hypotheses: |-ff(0)| = 1 and |-ff| is 
a Z"-periodic continuous function and also is Holder continuous at the origin, 
then the function 8 is continuous and 8(0) = 1. 

The following result was proved by M. Bownik [2]. 
Proposition B. Assume that a Z™ -periodic function H satisfying (4) and 

\H(0)\ = 1 is of class C N (R n ) for some N = 1,2, Then the function 

0(t) = Il^Li \ H (( A *y 3t )\ ™ also of class C N (R n ) and 0(0) = 1. 

Proof of Theorem 1. First of all, we will prove the implication I) ^=> II). Ac- 
cording to Remark A, since H is a low pass filter in an A-MRA then \H\ is a 
low pass filter in some A-MRA. Thus, because T^ C L 1 (M n ) (~}Ha we finish the 
proof applying the condition B) in Theorem A. 

Let us prove the implication II) ^=> I). According to Lemma 1 the infinite 
product 



6(t) = U\H((A*)- j t) 
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converges almost everywhere on R" to a well defined A*-locally nonzero measur- 
able function. Observe that £ L 2 (M. n ) by the condition a) in Proposition A, 
and in addition, 9(A*t) = \H(t)\0(t) a.e. on R n . 

Let be the function defined by and consider the function <!># given by 
(2), then the condition c) in Proposition A tells us that <!># is a fix point for the 
operator P. 

If we prove that the function <J>e belongs to T^, then by the condition II) 
in Theorem 1 we will have that &$(t) = 1 a.e. on T™. Hence according to 
Theorem B the function 9 is a scaling function of an A-MRA with associated 
low pass filter \H\. 

Obviously, <£># is a ^"-periodic function and < $e(t). We do not write 
"a.e." in the above inequality because according to Remark B, the function 
9 is a continuous. So, by the same reason and due to the condition c) in 
Proposition A, $e(t) < 1 holds. Moreover, since 9 is a continuous function and 
9(0) = 1, the inequalities 0(t) < $e(t) < 1 yield that $e(0) = 1 and the origin 
is a point of continuity of <j>g. Therefore, <&# £ Ta- 

Finally, applying Remark A the proof of Theorem 1 will be finished. □ 

Proof of Theorem 2. In an analogous way that the proof of the implication 
I) =>• II) in Theorem 1 we can prove the implication 1) =4> 2) in Theo- 
rem 2. 

To prove the implication 2) => 1), let 0(t) = fl^Li \ H (( A *)~ j ^)\ and re- 
peating the schema of the proof of II) =^- I) in Theorem 1, it is enough if we 
prove that <&# € A^. From that proof we know that <frg G T^, then it remains 
to prove that &e is differentiable at the origin. 

Let us check that the partial derivatives of <frg at the origin exist and are 
zero. According to Proposition B the function is differentiable and 0(0) = 1. 
Thus using the inequalities 0(t) < <J>e(t) < 1 for every t £ M. n (see the proof of 
Theorem 1) we obtain that &e(0) — 1 and also 

,. i ^W-^O + fee,) ,^, (0(O)) 2 -(0(O + / t e t )) 2 

lim sup | 1 < lim sup — = 0, 

h^o h ' h-*o \h\ 

where the equality is true due to the function (0) 2 is differentiable and it takes 
a maximum value at the origin. Furthermore, 

|$ e (0)-$ fl (t)| , (0(O)) 2 -(0(t)) 2 

lim sup ' y ', , y n < lim sup v v ;; , , y " = 0. 
t^o t " t -o t 



Therefore, <&# is a differentiable function at the origin. 



□ 



References 

[1] C.Boor, R.DeVore, A.Ron; On the construction of multivariate 
(pre)wavelets, Constr. Approx. 9,123-166(1993). 



424 



LAWTON'S CONDITIONS ON FILTERS 



A. San Antolin 



[2] M.Bownik; Tight frames of multidimensional wavelets, Dedicated to the 
memory of Richard J. Duffin. J. Fourier Anal. Appl. 3, no. 5,525-542(1997). 

[3] A.Bruckner; Differentiation of real functions, Lecture Notes in Mathematics, 
659, Springer, Berlin, 1978. 

[4] C.K.Chui; An Introduction to Wavelets, Academic Press, Inc. 1992. 

[5] P.Cifucntes, K.S.Kazarian, A.San Antolin; Characterization of scaling func- 
tions in a multiresolution analysis, Proc. Amer. Math. Soc. 133, No. 4,1013- 
1023(2005). 

[6] A.Cohen; Ondelettes, analyses multircsolutions ct nitres miroirs en quadra- 
ture, Ann. Inst. H. Poincare, Anal, non lineaire 7, no. 5,439-459(1990). 

[7] A.Cohen, I.Daubcchies, J.C.Feauveau; Biorthogonal bases of compactly sup- 
ported wavelets, Comm. Pure Appl. Math. 45, no. 5,485-560(1992). 

[8] S.Dahlke, W.Dahmcn and V.Latour; Smooth rcfinable functions and 
wavelets obtained by convolution products. Appl. Comput. Harmon. Anal. 
2, no. 1,68-84(1995). 

[9] I.Daubcchies; Ten lectures on wavelets, SIAM, Philadelphia, 1992. 

[10] V.Dobric, R.F. Gundy, P.Hitczenko; Characterizations of orthonormal scale 
functions: a probabilistic approach, J. Geom. Anal. 10, no. 3,417-434(2000). 

[11] K.Grochening, W.R.Madych; Multiresolution analysis, Haar bases and self- 
similar fillings of R n , IEEE Trans. Inform. Theory, 38(2),556-568(1992). 

[12] R.F. Gundy; Two remarks concerning wavelets: Cohen's criterion for low- 
pass filters and Meyer's theorem on linear independence The functional and 
harmonic analysis of wavelets and frames (San Antonio, TX, 1999), 249- 
258, Contcmp. Math., 247, Amer. Math. Soc, Providence, RI, 1999. 

[13] R.F. Gundy; Low-pass filters, martingales, and multiresolution analyses, 
Appl. Comput. Harmon. Anal. 9, no. 2, 204-219(2000). 

[14] E.Hernandez and G.Weiss; A first course on Wavelets, CRC Press, Inc. 
1996. 

[15] R.Q.Jia and C.A.Micchelli; Using the refinement equations for the con- 
struction of pre- wavelets. II. Powers of two. Curves and surfaces (Chamonix- 
Mont-Blanc, 1990), 209-246, Academic Press, Boston, MA, 1991. 

[16] J.C.Lagarias, Y.Wang; Orthogonality criteria for compactly supported re- 
finable functions and refinable function vectors, J. Fourier Anal. Appl. 6, 
no. 2,153-170(2000). 

[17] W.M.Lawton; Tight frames of compactly supported affine wavelets, J. 
Math. Phys. 31, no. 8,1898-1901(1990). 



LAWTON'S CONDITIONS ON FILTERS 425 



A. San Antolin 



[18] W.M.Lawton; Necessary and sufficient conditions for constructing or- 
thonormal wavelet bases, J. Math. Phys. 32, no. 1,57-61(1991). 

[19] R.A.Lorentz, W.R.Madych, A.Sahakian; Translation and dilation invari- 
ant subspaces of i 2 (R) and multiresolution analyses, Applied and Compu- 
tational Harmonic Analysis 5, no. 4,375-388(1998). 

[20] W.R.Madych; Sonic elementary properties of multiresolution analyses of 
L 2 (R d ), Wavelets - a tutorial in theory and applications, Ch. Chui ed., Aca- 
demic Prcss,259-294(1992). 

[21] S.Mallat; Multiresolution approximations and wavelet orthonormal bases 
for L 2 (R), Trans, of Amer. Math. Soc, 315,69-87(1989). 

[22] Y.Meyer; Ondelettes et operateurs. I, Hermann, Paris (1990) [ English 
Translation: Wavelets and operators, Cambridge University Press, (1992).] 

[23] I.P.Nathanson; Theory of functions of a real variable, London, vol. I, 1960. 

[24] M.Papadakis, H.Sikic, G.Weiss; The characterization of low pass filters and 
some basic properties of wavelets, scaling functions and related concepts, J. 
Fourier Anal. Appl. 5, no. 5,495-521(1999). 

[25] A.San Antolin; Characterization of low pass filters in a multiresolution 
analysis (to appear in "Studia Mathematical) 

[26] R.Strichartz; Construction of orthonormal wavelets, Wavelets: mathemat- 
ics and applications, Stud. Adv. Math., CRC, Boca Raton, FL, 23-50(1994). 

[27] PWojtaszczyk; A mathematical introduction to wavelets, London Mathe- 
matical Society, Student Texts 37, 1997. 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS, VOL.8, NO.3, 426-438, 2010, COPYRIGHT 2010 EUDOXUS PRESS, LLC 



Strip-saturation Model Solution for Piezoelectric 

Strip ~ by Quadratically Varying Electric 

Displacement 

*R. R. Bhargava and 2 Amit Setia 

Department of Mathematics 

Indian Institute of Technology Roorkee 

Roorkee, 247667, India 

e-mail: ^ajrbfmaCMitr.ernet.in, 2 setiadma@iitr.ernet.in 

Abstract 

A crack arrest model is proposed for a cracked poled infinitely long and 
composite narrow piezoceramic strip. The finite crack is symmetrically 
situated and oriented longitudinally with respect to the edges of the strip. 
Uniform anti-plane shear stress or strains and in-plane normal electrical 
displacement applied on the finite distant edges of the strip. Consequently 
strip yields both mechanically and electrically. Under the assumption that 
the strip is electrically more brittle, an electrical singularity is encoun- 
tered first, ft is at this level the investigations are carried. To stop the 
crack from further electrical polarization the rims of the developed satura- 
tion zones are prescribed quadratically varying, normal, cohesive, in-plane 
saturation limit electrical displacement. The load required to arrest the 
developed saturation zone is assessed. In-plane electrical crack opening 
displacement, electrical crack growth rate's expression are obtained. Case 
study has been presented for BaTi0 3 , PZT-4 and PZT-5H strips. 

Keywords: Piezoelectric strip, strip-saturation model, crack opening 
displacement, crack growth rate, saturation zone 

1 INTRODUCTION 

Gao and group [8, 7] established that local energy release rate for a piezoelectric 
crack with electrical yielding confined to a strip in front of the crack. It is in- 
dependent of the yielding parameters and can be fully determined from a linear 
piezoelectric crack analysis. They further investigated the effects of electrical 
yielding on a finite crack lying perpendicular or parallel to the poling axis of 
an infinite poled piezoelectric ceramic medium. A crack perpendicular to the 
poling axes in a general poled ferroelectric is discussed by Ru [4] for the im- 
plications of the strip-saturation model for a electric field inducing crack. He 
[5] also conducted the studies for mixed boundary value problem and obtained 
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near crack tip field for a conducting crack parallel/perpendicular to the poling 
axis using based on a strip-saturation model. Wang and Zhang [1] discussed an 
electric strip-saturation model for fracture prediction of piezoceramics contain- 
ing electrically impermeable cracks. Wang and Mai [2] investigated the fracture 
behavior of a cracked piezoceramic medium under transient electromechanical 
loads. The work on cracked piezoelectric strip was started by Shindo et al. 
[12]. They used the theory of linear piezoelectricity to solve the electroelastic 
problems of a finite crack in an orthotropic piezoelectric strip. Fourier integral 
transform technique was used to reduce the problem to solve a pair of dual in- 
tegral equations. They [13] extended the work to study the singular stress and 
electric field in an orthotropic piezoelectric ceramic strip containing a Griffith 
crack under longitudinal shear. Li [3] analyzed the problem of a finite crack 
in a functionally graded material strip under an antiplanc mechanical and in- 
plane electrical loading. In this case elastic stiffness, piezoelectric constants, 
and dielectric permittivity were taken to vary along the thickness of the strip. 
Li [9] examined the strip-saturation model for piezoelectric crack in permeable 
environment to analyze fracture toughness of a piezoelectric ceramics. In this 
study a permeable crack was modeled as a vanishing thin but finite rectangular 
slit with surface charge deposited along a crack surface. 

2 METHODOLOGY 

As is well-known out-of-planc displacement problem along xoy— plane may be 
defined as 

u x (x, y, z) = u v (x, y, z) and u z (x, y, z) = u z (x, y) (1) 

where u,i{i = x, y, z) define the displacement components along x, y and z~ directions. 
Similarly an in-plane electric field problem for xoy— plane is defined as 

E x (x,y,z) = E x (x,y),E y (x,y,z) = E y (x,y) and E z (x,y,z) = (2) 

where Ei,(i = x, y, z) denotes the electric field component along z— direction. 

Consequently linear piezoelectric theory the constitutive equations may be 
written as 

Oxz = C 44 U ZiX + ei50,a; (3) 

Oyz = C Ai U Zi y + ei50 lK (4) 

D x = ei 5 u ZjX - eu4> tX (5) 

D y = ei 5 u z ^ y - en0 )3/ (6) 

where cr iz , Di{i = x, y) denote the shear stress component, electric displacement 
component. A comma after function denotes its partial differentiation with 
respect to the argument following it. C44, e±5 and en denote elastic piezoelectric 
and dielectric constants respectively 
The gradient equations reduce to 

liz = u Zii (7) 
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E l = -cf> }i (8) 

where i = x,y. 

Stress equilibrium equation in absence of body forces are given by 

aijj = (9) 

where i, j — x, y, z. Electrical displacement equation in absence of body electric 
charge may be written as 

A,i = o (io) 

The governing equations are obtained substituting Eqs.(3 to 6) into equilibrium 
Eqs. (9, 10), which finally, reduce to the solution of 

V 2 u z = and V 2 = (11) 

where V = -^ + -§-? is the Laplacian operator. 

Using Fourier cosine transform solution of Eq. (11) may be written as 

2 f°° 
u z (x,y) = — / L4i(a) cosh(ay) + A 2 (a) s'mh(ay)] cos(ax)da + a^ y (12) 
Wo 

and electric potential, <j) is given by 

2 f°° 
4>{x, y) = — / [Bi(a) cosh(ay) + B2(ct)smh(ay)]cos(ax)da — b^y (13) 

71" Jo 

where Ai(a), Bi(a) are the arbitrary functions. These are determined using 
boundary conditions of the problem under investigation. And arbitrary con- 
stants ah and bh are obtained using conditions prescribed on the edges of strip. 
Since the boundary condition are also prescribed on permittivity of the vac- 
uum inside the crack, the constitutive equation for electric displacement com- 
ponents DY , (i = x,y) reduce to 

D v x - e E x (14) 

DY = e E y (15) 

where eo is the electrical permittivity of the vacuum and ei5 = e^i — e^ = 0. 
The governing equation for potential for potential <j> v in vacuum reduce to 

W 2 <j> v = (16) 

The solution of which using Fourier transform technique and the condition 
D y (x, 0) = D^(x,0) may be written as 

2 f°° 
<f> (x,y) = — C(a) sinh(ay) cos(ax)da for < x < c (17) 

71" Jo 

Opening mode electric displacement intensity factor, at the tip x — a, is 
defined as 

Kf = lim W2n{x - a)D y (x,0)} (18) 
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In-plane open mode electrical displacement, D y (x) is calculated using 

2 f a 
D v (x) = — / M(x,a)K?(a)da 
ei5 Jx 



where 



taken from ref. [10]. 



M(x,a) 



3 THE PROBLEM 



a 1 



7T y/a 2 — X 2 



da 



(19) 



(20) 
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Figure 1: Schematic representation of the problem 



An infinitely long narrow piezoceramic strip occupies the region —h<y<h 
and — oo < x < oo in xoy— plane as shown in Figure 1. The strip is assumed to 
be uniformly thick along z-direction to allow the anti-plane shear stress/strain 
state. The strip is poled along z— direction. The infinitely distant edges of 
the strip are stress/strain and charge free. The strip is cut along a hairline 
straight crack which occupies the interval y = and — c < x < c and ori- 
ented longitudinal to the edges of the strip. The rims of the crack are stress 
and charge free. The edges x = ±h are prescribed uniform anti-plane shear 
stress T yz (x,±h) — Th or deformation j yz (x,±h) — 7^ together with in-plane 
normal electrical displacement D y (x,±h) = Dh- Under the assumption that 
strip is electrically more brittle hence an electric singularity is encountered first. 
Consequently under small-scale electric polarization condition a strip-saturation 
zone develop ahead of each tip of the crack. Each of the saturation zone occu- 
pies the interval j/ = 0, c < x < a and — a < x < — c, respectively. To arrest 
further polarization the rims of the developed saturation zones are subjected 
to cohesive electrical displacement D y = mx 2 , where under small-scale electric 
saturation m = D s /c 2 ,D s being the saturation limit electrical displacement. 
Consequently the crack is stopped from further opening. 
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4 MATHEMATICAL MODEL 

A poled piezoceramic strip occupies the interval \y\ — h and |x| < oo in 
xoy— plane. The strip is cut along y = 0, |x| < c. Due to the symmetry in 
the problem only first quadrant region is considered. The conditions prescribed 
above may be mathematically get translated as 

(i) On finitely distant edges of the strip y — h, x — > oo 

(a) Case I : a yz (x, h) = 77,, , D y (x, h) = D h 

(b) Case II : J yz (x, h) = j h , D y (x, h) = D h 
(ii) 4>{x, 0) = 0, for c < x < oo 

(iii) E x (x,0) =E^(x,0),ior: < x < c 
(iv) D y (x,0) = mx 2 H(x - c),for < x < a 
(v) u z {x, 0) = 0,for a < x < oo 

5 ANALYSIS AND SOLUTION 

The general solution of the problem is written using Eqs. (12 and 13). Arbitrary 
functions and constants are determined using boundary condition (i to iv) as 

follows. 

5.1 Determination of arbitrary constants an and bh 

Substituting from Eqs. (12 and 13) into Eqs. (4, 6) and using each (i) and 
simplifying one obtains for 
Case I: 

i eii7v, - ei 5 A, 

a h = 9 — (21) 

c 44 eii + e? 5 

bI e 15 r h -c 44 D h 

c 44 eii+e2 5 

where superscript I denotes that the quantity refers to Case I. 

And for Case II analogously using Eqs. (12, 13, 4, 6), boundary condition 
(i)(b) and calculating, following is obtained 

ai 1 - lh (23) 

b u = D h -e lblh (24) 

6ll 

Superscript II denotes that the quantities refer to Case II. 

5.2 Determination of arbitrary functions Ai(a) and Bi(a), (i 
1,2) 

Remaining of the boundary condition (ii to v), using appropriate constitutive, 
gradient and Eqs. (12 and 13) yield a set of integral equations to enable to find 
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Ai(a) and Bi{a) as follows: 

Boundary condition (ii) together with Eq. (13) leads to integral equation 



Bi(a) cos(ax)da — 0; c < x < oo (25) 

'o 

Boundary condition (iii) using Eqs. (13, 17, 8), yields the integral equation 

/■OO 

/ aBi(a) sin(ax)da — 0; < x < c (26) 

Jo 

Solving above pair Eqs. (25, 26) of dual integral and introducing Bi(a) as 

B 1 (a) = 7 ^-j y/£* 2 {£)J (caS)d£ (27) 

It is obtained that ^2(0 = which implies that 

Bi(a) = (28) 

Boundary condition (iv), Eqs. (12, 13, 6, 17, 15) can be simplified to yield 



X 



ei5 / aAi(a) tanh(a/i) cos(ax)da + c?o = rax H{x — c), < x < a (29) 

n Jo 

do = a\ + b\ (30) 

where i = I, II. 

Boundary condition (v) together with Eq. (12) gives 

Ai{a) cos(ax)da — 0; a < x < oo (31) 

o 

Introducing for the convenience of computations 

A 1 (a) = '^-f y/S^ffiJoiaaZW (32) 

and solving the pair of dual integral Eqs. (29, 31), one finally obtains after 
computations a Fredholm integral equation of second kind for determining Wi (£) 
from 

*i(0+ / K(S, V )* 1 (ri)dv = 



-<e<i 

a 
(33) 



JO 

^ ' . K a 



D ^i/2 ma 2 f 5 / 2 / 2 . (c a\ 1 . / . /c/a 

— ^ 1 arcsin -\ — sm 2 arcsm 

2ei 5 V 7T l(/\ \ V £ 



Ai(a) is now determined using Eqs. (33 and 32). 
where kernel K(£, rj) 

/*°° / h \ 

K(^,rj) = ytf] / atanh 1 ) Jo(ar)) J^(a^)da (34) 

Jo V a ) 

Equation (33) in turn is solved numerically using MATHEMATICA. 
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6 APPLICATIONS 

Results obtained are applied to calculate open mode electrical displacement 
intensity factor K®(a), saturation zone length, in-plane electric crack opening 
displacement, crack growth rate of electrical crack opening displacement. 

6.1 Open- mode electrical displacement intensity factor Kf(a) 

Substituting in formula (18) from Eqs. (6, 12, 13, 28, 32, 33 and 34) and 
simplifying one obtains the Kf(x) at x = a. 

Kf{a) = ei6V^o*i(l) (35) 

6.2 Saturation zone length 

Saturation zone is obtained using the hypothesis that electrical polarization 
vanishes at x = a, based on Dugdalc hypothesis. Consequently Eq. (35) yields 
*i(l) = 0. 

Equating ^i(l) using Eq. (33 and 34) gives a transcendental equation to 
determine a from 



••--'OWS- 1 



do - e 15 T(h/a) 



(36) 



where 



T(h/a)= f if (1, 77)^1 (r?)*7 (37) 

Jri=0 

7 CRACK OPENING IN-PLANE ELECTRI- 
CAL DISPLACEMENT 

Crack opening in-plane electrical displacement, D y {x), is obtained superimpos- 
ing the [D y ] D due to electrical displacement, D h , prescribed on the edges of the 
strip and the normal cohesive in-plane electric displacement [D y ] 2 prescribed 
on the rims of the developed saturation zone. 

Using formulae (19, 20) and substituting [lfP(a)] „ = doy/ifa and inte- 
grating, one obtains 

[D v (x)} Dh = — y/a?-x* (38) 

eis 

Similarly [D y (x)] mx2 is obtained using formulae (19, 20) and [KP(a) mx 2^ — 

cos" 1 ^ + ^"V 1 ' 



m 5/2 



and integrating 



\D (x)] 2m '"' 



7rei 5 J a=x y/, 



a* — x^ 



n _-, c 1 . /„ . _i / c 

sin 1 — sm 2 sin — 

2 a 2 V \a 



(39) 
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Consequently the crack opening displacement of crack face is obtained using 

D v (x) = [D v (x)] Dh - [D y (x)] mx2 (40) 

Total crack opening displacement is obtained by 2D y (x). 



8 ELECTRIC DISPLACEMENT GROWTH RATE 

Under the assumption that cyclic condition could be obtained from the pre- 
scribed monotonic condition. These could be obtained by substitution 



D h 



AD h 



.D 



A A, 



D„ 



D„ 



where A in front of a quantity denotes that quantity is under cyclic conditions. 
Dq is the electrical displacement in absence of mechanical load. The accumu- 
lated crack opening displacement is defined as 



E|AD„(a;)| 



5c 



AD y (x)dx, 



(41) 



where 5 c = -^ is the crack growth rate, where N is number of loading cycles 
which is equal to (a — c)/5c for the crack to advance a distance (a — c) for applied 
electric displacement range. 

The fatigue crack propagation [11] is assumed to start when the accumulated 
electric displacement Dt of a certain point equals a critical value, D c and gave 
a crack growth rate criteria 



D T = T,\AD y (x) 
This together with Eqs. (41 and 42) gives 



Dr. 



(42) 



dc _ 2 
dN ~D~, 



/ AD y (x)dx, 



12 12 



1 /a\ 2 
6 Vc 



i/^K'G)} 



7rei 5 £> c 
which to first approximation may be written as 



o lo S , 
6 \a 



dc 

dN 



w{AKy(a)} 



192e 15 7A 2 c 
where 7 = D C D SC , is the effective surface energy for crack growth and 



(43) 



(44) 



AKf = [AD h - 2e 15 T(h/a)} 



(45) 
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9 CASE STUDY 

Results obtained arc applied to investigate the crack arrest for BaTi03 , PZT-4 
and PZT-5H strips. Material constants are taken from [6] and listed in Tabled. 
Table I: Material constants for piezoelectric ceramics. 

Material Constants PZT-4 PZT-5H BaTiO, 



c 44 (10 lu A7m 2 ) 

e 15 (C/m 2 ) 

en (10- w C/Vm) 



2.56 
12.7 
64.6 



2.3 
17 
150.4 



4.3 
11.6 

112 




1 2 3 

Strip width to crack length ratio, h/c 



Figure 2: Variation of applied electrical displacement versus strip width to half- 
crack length ratio 
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Figure 3: Variation of applied electrical displacement versus saturation zone 
length to half-crack length ratio 



Figure 2. depicts the variation of variable load ratio D^/mx 2 for small scale 
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yielding, with respect to strip width to crack length ratio h/c. For narrow strip 
a non-linear parabolic variation is observed for Dh/mx 2 , as expected. But as 
the thickness of the strip is increased for a fixed crack length then more Dh/mx 2 
required to arrest the crack before the requirement settles for a uniform constant 
value. It is also noted as x/c ratio is increased then less load ratio is required 
to arrest the crack growth. 

Variation of D^/mx 2 vis-a-vis saturation to half-crack length, (a — c)/c, is 
plotted in Figure 3. It is seen as the saturation zone is increased, for a fixed 
crack length, more Dh/mx 2 is needed, as expected. It is also to be noted that 
variation remains same for both the cases. 



Case I 
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Strip width to crack length ratio, h/c 



Figure 4: Variation of normalized electric crack growth rate versus strip width 
to half-crack length ratio for Case I 
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Figure 5: Variation of normalized electric crack growth rate versus strip width 
to half-crack length ratio for Case II 



For Case I crack growth variations versus strip width to crack length ratio 
is depicted in Figure 4. The variation first shows a steep parabolic decrease 
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which stabilizes uniformly to a lower value as strip width is increased. The 
crack growth is also independent of ceramic properties. It is seen as the ratio 
value p = e 15 T7j/c44£)o is increased as 0.5(-0.25)-0.25, the crack growth almost 
becomes negligible. 

Same variation for Case II is plotted in Figure 5. The crack growth for 
this case, is almost four times smaller than that for Case I. The variation 
is material properties dependent. PZT-4 ceramic strip has the least crack 
growth rate closely followed by PZT-5H strip. For lower values of the ratio 
q = e 15 e 11 7; l /e 11 £'o = —0.25 the crack growth rates for both the ceramics 
PZT-4 and PZT-5H are negligibly small. Comparatively although BaTiC>3 has 
higher crack growth rate but it also drops down substantial as the ratio value q 
is reduced from 0.5 to -0.25. 
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Figure 6: Variation of normalized electric crack growth rate versus saturation 
zone to half-crack length ratio for Case I 



\ Case II 

\. e i5 F iirh /E n D o =0 - 5 


- ^^. 0.25 |H\ 


BaTi0 3 

PZT-4 

PZT-5H 


-0.25 ^~*ML 

- - j, o \~~~hLj 





0.30 0.35 0.40 0.45 0.50 0.55 0.60 

Saturation zone to crack length ratio, (a-c) / c 



Figure 7: Variation of normalized electric crack growth rate versus saturation 
zone to half-crack length ratio for Case II 
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For Case I electric crack growth rate variation vis-a-vis saturation zone to 
half-crack length, (a — c)/c, is plotted in Figure 6. The crack growth rate is 
independent of ceramic properties and show a continuous decrease as (a — c)/c 
ratio is increased. It is also noted as the value of the ratio p is reduced from 0.5 
to -0.25 the crack growth rate reduces further tending to almost zero. 

Figure 7. depicts the variation of crack growth rate with respect to ratio 
(a — c)/c for Case II. For Case II the crack growth reduces to more than half 
than that for Case I. The crack growth rate is least for PZT-4 strip which almost 
zero as q ratio is reduced from 0.5 to -0.25. Although BaTiC>3 strip has a higher 
crack growth rate but it continuously reduces to almost become zero. As the 
ratio value q is reduced from 0.5 to -0.25 it drops down further. 

10 CONCLUSION 

A generalized strip-saturation model is proposed for a longitudinally cracked 
poled piezoceramic strip. The analytic expressions are obtained for electrical 
crack opening displacement and crack growth rate. Electrical displacement re- 
quired to be prescribed for saturation zone to arrest crack are also obtained in 
closed form expressions. Case study has been presented for BaTiC>3, PZT-4 and 
PZT-5H strips which shows that the model proposed is capable of crack arrest. 
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Abstract 

Let 7Tjv stand for the space of trigonometric polynomials of degree N 
and F : L p [0,2n] —> 7Tjv denote the Fourier projection onto tin- It is 
well-known that, for all 1 < p < oo, F is the projection onto ttn with 
the minimal norm. It is an open problem whether for every 1 < p < oo, 
p 7^ 2, F is a unique minimal projection onto tin- The aim of this paper 
is to prove the unique minimality of F in L p [0, 2n] spaces for p sufficiently 
closed to 1. 



1 Introduction 

Let 7Tjv = span{l,sin(x),cos(x), ...,sin(iVx),cos(iVx)} stand for the space of 
trigonometric polynomials of degree N. The Fourier projection F : L p [0, 27r] — » 
kn , 1 < p < oo is defined by 

i r 2lT N f\ r 27T \ 

F(f)(x) = y / f ^ dt+ J2(- f(V sin ( fct ) dt sin ( fcx ) 

(i) 

N / 1 /-27T 



e! 



. TT 
k=l 



fit) cos(kt) dt I cos(fca;). 



One can see that F(f) — f * Djv, that is 



2 77 



F(f)(x) = ^l f(t)D N (x-t)dt, (2) 

27r Jo 
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where Dn is a Dirichlct kernel 

D N (x) = l + 2cos(a;) + ... + 2cos(iVx). (3) 

Using the Dirichlct kernel we can compute the norm of the Fourier projection 

in Li[0,2tt] 

\\F\\ 1 = ±-["\D N (x)\dx = L N , (4) 

where Ljv are famous Lebesguc's constants. There are two exact formulas for 
Lebesgue's constants, the Fejer formula 

1 v~* 2 k-rr . 

L N = h > — tan( ) (5) 

2N + 1 £*> kn y 2N + l' y ' 



fe=i 



and the Szego formula 



oc 



16 ^ / 1 1 1 \ 1 

LN =V^X + ? > + l + - + 2(2N + l)k-l)Ak^l- (6) 

Additionally, the exact asymptotic behavior of Ljv is known to be Ln ~ -% In N. 
Consider the following isometries I s : L p [0,2n] — > L p [0,27r], s G [0, 27r), 

W)(x) = f(x + s), (7) 

where the addition of x + s is considered to be modulo [0, 27r] . One can easily see 
that I s (itn) = I" jv, and that the Fourier projection is the only projection that 
is commuting with {7 S , s G [0, 2tt)} 

FoI s = I s oF. (8) 

Following this observation, we get the Berman representation for the Fourier 
projection [1], 

F=^j (Ia^oPoI.da, (9) 

where P is any projection L p [0, 27r] — ► npf. 

From that we obtain minimality of Fourier projection in L p [Q, 2tt] spaces. 
Minimality of Fourier projection in C[0,27r] has been proved earlier in 1948 by 
Lozinski in [6]. Whether the Fourier projection is the unique minimal or not 
has proved to be a challenging problem. In 1969, Cheney et al [2], have proved 
that the Fourier projection is a unique minimal in C[0, 27r] and £oo[0, 2ir]. That 
same year, Lambert [4], proved the same for Li[0, 2w]. The case of L p [0, 2ir] for 
1 < p < oo,p 7^ 2 is still open. In our previous paper, [9], we proved that for 
N = 1 the Fourier projection F : L p [0,2ir] — > tt\ is unique minimal. i p [0,27r] 
for 1<p<oo,pt^2 are smooth and uniformly convex spaces. It is worth 
mentioning that for any d > 3 we can find a finite dimensional subspace V , 
dim]/ = d such that the minimal projection from L p [0,27r] onto V is not a 
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unique minimal [11]. For d = 1,2 all minimal projections have to be unique 
[10]. Uniqueness of related Rademacher projections in L p [0, 2ir], 1 < p < oo has 
been proved in [5]. In this paper we prove that Fourier projection is unique 
minimal in L p [0, 2tt] for p sufficiently close to 1. 

Definition 1.1 Let L : X — > Y be a linear operator. Functional g G S(Y*) is 
called a norming functional for L if 

\\goL\\ = \\L\\. (10) 

Point f G S(X) is called a norming point for L if 

\\L(x)\\ = \\L\\. (11) 

A pair (g, /) G S(Y*) x S(X) is called a norming pair for L if 

9(Lf) = \\L\\. (12) 

If P is a projection from X onto a finite-dimensional subspace Y, then (since 
P is a compact operator) it has a norming functional (see Theorem III. 2.1 [7]). 
If X is reflexive, then any functional attains its norm. Therefore, there is a 
norming pair for P. If X is not reflexive then in general, it is not true. For 
example, the Fourier projection does not attain its norm in C[0,27r]. But any 
functional attains its norm in X** , hence we can always find a norming pair for 
P (extending P to P**) in S(X*) x S(X**). 

The properties of norming functionals play important roles in proving the 
uniqueness of a minimal projection. The following theorem is a special case of 
the result proved in [9]. 

Theorem 1.2 (Theorem 3.2 [9]) Let F : L p [0,2n] -> n N be the Fourier pro- 
jection. Assume that the set of norming functionals for F is total on ttn (i.e., 
if v G 7T/v, and f(v) = for any norming functional f £ -L g [0,27r] for F, then 
v = 0.) Then the Fourier projection F is the unique minimal projection. 

The main result of this paper is the following: 

Theorem 1.3 Fix N and consider the Fourier projection F : L p [0,2ir] — » 7Ttv- 
Then there is an e > such that for every p G (1, 1 + e) the Fourier projection 
is the unique minimal projection from L p [0,2ir] onto 7T/\r. 

2 Results 

We begin with some preliminaries. 

Lemma 2.1 (Ruess-Stegall [8]) Let X be a Banach space and V be the finite 
dimensional Banach space. Then 

L(X,V)=L W *(X**,V), (13) 
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where "=" means isometrically isomorphic and L W *(X** , V) stands for all weak* 
continuous operators. Additionally, this isometry is given by 

i:L^L**. (14) 

Since ^~ dm is a probabilistic measure, for every 1 < r < s < oo we have 

Loo[0,27r] C L s [0,2ir] C L r [0,27r] C Li[0,2tt] (15) 

Theorem 2.2 Fix N and consider the Fourier projection F : L p [0,2ir] — ► ttj^. 
Take any p n — ► 1 ararf fe£ g n £ S(L qn [0,2ir]) be any norming functional for F 
in L Pn [0, 2tt] (as usual p n and q n are conjugates to each other — + — = 1). 
Then we can find a subsequence of g n that converges weakly in Li[0,27r] to some 
function g, \g\ = 1 a.e. such that g is a norming functional for F in Li[0,2t:]. 

Proof. Take /„ <G S(L Pn [0,2n]) such that (g n ,fn) is a norming pair for F in 

L Pn [0,2ir], that is 

g n (Ff n ) = \\F\\ Pn . (16) 

Since \\F \\ Pn — > ||-F||ii we a ls° have 

lin/n)lk-|l^l|l- (17) 

Observe that F{f n ) e tt n C £i[0,2tt], and ||F(/„)||i < \\F{f n )\\ Pn = H^IL- 
Therefore F(f n ) is bounded in ttn with L\ norm. Since ttn is finite dimensional 
F(f n ) is bounded in any other norm. As a result going into subsequence if 
necessary we can assume that 

F(fn) ^W£lT N . (18) 

Observe that w ^ (otherwise F(f n ) — ► in L^ norm, and that for a big 
enough n would imply ||-F|| P „ = ||-F 1 (/n)||p„ < 1/2)- Therefore w has only a 
finite number of zeroes. Using (16) for n big enough we have 

u (t) | = JTOi^ (19) 

for all t except possibly finite many. Since p n /q n — p n — 1 — > and F(f n )(t) — » 
w(t) (see (18)) using the continuity of the function /i(x, y) = a; 2 ' at points (a, 0) 
where a > we get |( J F 1 /„)(t)| p " /9 " -» 1 and (||i ?, ||p n ) p ' l/9n -» 1. Therefore 

l5n(*)| - 1, (20) 

for all £ except possibly finite many. As a result 

fln(t) -»<?(*), (21) 

almost everywhere, |<?(£)| = 1 and therefore g £ S'(L oo [0, 2ir]). Observe that 
both F(f n ) and w are continuous functions. Using (18) there is a constant M 



SHEKHTMAN-SKRZYPEK: ...FOURIER 443 

PROJECTION... 



such that \F(f n )(t)\ < M. By (19) there is a constant K such that \g n {t)\ < K 
for all t except finite many points. That means that g n G £00 [0, 2tt] and by 
Lebesgue's dominated convergence theorem: 

g n (t)h(t) dm ->/ g(t)h(t)dm, (22) 

Jo 

for every h G Li[0,27r]. 

Now we need to show that this g is a norming functional for F in £i[0, 2ir]. 
Consider F** : L\* — > 7Tjv- Observe that 

||/n||l<||/n|k = l. (23) 

Therefore /„ G Li[0, 27r] and using canonical embedding of X in X** we have 
/„ G L"[0,27r] and ||/ n ||L** < 1. As a result /„ has a weak* convergent subse- 
quence in L**. Passing to subsequence, if necessary, we may assume that 

/„ -^ /, (24) 

weak*inif*[0,27r]and||/||i« < 1. By Lemma 2.1, since F**(f n ),F**(f) G tt n , 
we have 

F**(f n ) ^ F**(f), (25) 

in norm topology. By (22) we have 

g„(h)^g(h), (26) 

for every h G ttn- Putting the above two facts together we get 



\g n (F**f n ) - g(F**f)\ = \g n (F**f n - F**f) + (g n - g)(F**f)\ 

< \\F**f n - F**f\\ + \{g n - g){F**f)\ - 0. 



(27) 



But g n (F**f n ) = g n {Ff n ) = \\F\\ Pn -» ||F||i = \\F\\ Lr . Therefore 

g(F**f) = \\F\\ L ». (28) 

And since ||/||l** < 1 we have \\g o F**\\ = ||F||i*« and as a result 

1100^11 = ||F||i, (29) 

so g is a norming functional for F in £i[0, 2tt]. ■ 

Lemma 2.3 Every norming functional g G S^Lo^O, 2ir]) for the Fourier pro- 
jection F in Li[0,27r] has to be of the form 

g(t) = signD N (t + s), (30) 

for some s G [0, 2tt] (here the addition is considered to be modulo [0, 2ir]). 



444 



SHEKHTMAN-SKRZYPEK: ...FOURIER 
PROJECTION... 



Proof. Let g £ S'(i (x) [0, 2ir)) be a norming functional for Fourier projection 
F : L\[0,2n] — ► ttn- Denote dt = 2^dm(t) and dx — -^—drnix). For any t we 
have 



2 77 



q{x)Dn{x — t) dx 



2lT 

< I \g(x)D N (x-t)\dx 
o 

•2tt i-2-k 

\D N (x-t)\dx= / \D N (x)\dx. 
'o Jo 

Since Dn(x) = 1 + 2cos(x) + ... + 2cos(Nx), the function 

2tt 



M*) = / g{x)Dj^{x — t)dx 



is continuous on [0, 27r]. We will show that 



max 

tG[0,27r] 



g(x)Dpf(x — t) dx 



\Dn(x)\ dx. 



Assume for the contrary that there is 5 > 

/■2-7T 

max / q(x)Dm{x — t) dx = 

te[0.2n] in 



2tt 



\D N (x)\dx) -S 



Take any / £ S(Li[0, 2ir)). Using Fubini's Theorem we would get 

\g(Ff)\- 



r-2ir ,-2n 

(/ f(t)g(x)D N (x-t)dt)dx 
'0 Jo 

(/ f(t)g(x)D N (x-t)dx)dt 
'o Jo 

f(t)(l g(x)D N (x-t)dx)dt 
io 

r2it 

< / l/WI 

JO 
r 2ir / / ,2tt 

^ / Ifmi I I / \D N (x)\dx) -5) dt 



o 



2 77 



g(x)Dpf(x — t) dx 



o 



r// 



(31) 



(32) 



(33) 



(34) 



(35) 



f(t)\ 

\ \J0 

^\D N {x)\dx^-6yj*\f(t)\dt) 

2vr \ 

\D N (x)\dx) -S = ||F||i -<J. 



That implies that 5 is not a norming functional for F, a contrary. Therefore 
(33) holds and, as a result of (31), there is io such that 



g(x)D N (x-t )dx 



and as a result g(x) — sign D^{x — to)- 



\g(x)D N (x-t Q )\dx, 



(36) 
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Theorem 2.4 Fix N and consider the Fourier projection F : L p [0,2ir] — ► ttn. 
Then there is an e > such that for every p € (1, 1 + e) we can find a norming 
functional h € S(L g [0, 27r]) for F in L p [0,2ir] such that 

/•27T /"27T 

(/ h(t) sin(kt) dt) 2 + ( h{t) cos{kt) dt) 2 ^ 0, /or fc = 0,1,..., N. (37) 



Proof. Assume for the contrary that there is a sequence p„ — ► 1 and a sequence 
of norming functionals g n € S(X gri [0, 27r]) for F 1 in L Pii [0,27r] such that 

/ g n (t) sin(kt) dt = and / g n (t)cos(kt) dt = 0, (38) 

Jo Jo 

for some fc € {0, 1, ...,7V}. Using Theorem 2.2 we get 

/•27T /•2'Tr 

/ g(i) sin(fci) rfi - and / gi(t) cos(fci) dt = (39) 

Jo Jo 

and <? is a norming functional for F in Li[0,27r]. But Lemma 2.3 gives g(t) — 
sign D n {x+s). Using Fejer formula [3], we know the first 2iV+l terms in Fourier 
expansion of sign D n (t) : 

1 v~^ ^ m7r 

Sl gnD n (t) = WT - i + ^ — tan ^^ cos(mt) + ... (40) 

m— 1 

From the above we can easily see now that (39) cannot occur. ■ 

Theorem 2.5 Fix N and consider the Fourier projection F : L p [0,2ir] — ► tt^. 
Then there is an e > such that for every p £ (1, 1 + e) i/ie Fourier projection 
is the unique minimal projection from L p [0,2ir] onto ttn- 

Proof. Take e and functional fr from Theorem 2.4. Observe that, by (8), if h(i) 
is a norming functional then /i s (t) = h(t + s) is also a norming functional. We 
will show that the set of norming functionals {h s ,s e [0, 27r]} is total over 7Tat. 
That is if u>(x) G 7Tjv and for every s £ [0, 2-7r] : 

i r2ir 



then w = 0. Take 



, h(t + s)w(t) dt = 0, (41) 

2tt Jo 



1 /" 27r 
a = — h(t) dt 

27r Jo 

i /.2ir 



7T 



a k = - h(t) sin(fci) dt (42) 



o 



1 /-27T 



7T 



fefe = — / /i(£) cos(kt) dt. 



o 
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By Theorem 2.4 ao # and a k , b k ^ for k = 1, ..., iV. It is easy to see that 
h(t + s) dt = a 



i r-2-K 



i /-27T 



2Wo 

/i(i + s) sin(fct) dt — ak cos(fcs) — bk sin(fcs) (43) 

o 

"I f 27r 

h(t + s) cos(kt) dt — ak sin(fcs) + bk cos(fcs). 

Take w = c + J2k=i ( c *= sm (^^) + dk cos(kt)). By (43), the equation (41) would 
imply 

JV 

a o c o + /_^ [ck{o-k cos(fcs) — bk sin(fcs)) + dk{a,k sin(fcs) + bk cos(fcs))] = 0, (44) 
fc=i 

for every s £ [0, 2tt]. As a result 

N 

aoco + X! [(^fe a fc _ c kbk) sin(fcs) + (c k a k + d k b k ) cos(fcs)] = 0, (45) 
k=i 

for every s € [0, 2n]. That is 

a c = 

d k a k - c k b k = (46) 

c fe a fe + d k bk = 0. 

Since ao, a k , bk ^ then c = c^, = dk = for all k = 1, .., iV, as a result io = 0. 
Applying Theorem 1.2 we get the uniqueness of F. ■ 
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ABSTRACT 

We consider the problem of object registration based on a set 
of known template images. The proposed solution employs a 
weighted graph representation of images, and a method that 
reduces the high dimensional problem of evaluating the orbit 
created by applying the set of all transformations in the group 
to a template, into a set of linear equations. The method yields 
a very large number of independent linear constraints that en- 
able an explicit parametric estimation. Mathematical proof of 
the method is presented as well as analyzes and experiments 
that demonstrate its robustness. 

Index Terms — Image registration, Image recognition, Pa- 
rameter estimation, Nonlinear estimation, Graph representa- 
tion of images, Multidimensional signal processing 

1. INTRODUCTION 

This paper is concerned with the general problem of auto- 
matic image registration based on a set of known templates. 
More specifically the paper presents the notion of compactly 
storing the topology of the image in a graph representation, 
and based on this representation, proposes an algorithmic so- 
lution to the registration problem. The fundamental setting 
of the problem and common approaches are provided in [1]- 
[4]. There are two key elements in a deformable template 
representation: A typical element (the template); and a fam- 
ily of transformations and deformations which when applied 
to the typical element produces other elements. The family of 
deformations considered in this paper is extremely wide: we 
consider differentiable homeomorphisms having a continuous 
and differentiable inverse, where the derivative of the inverse 
is also continuous. 

Thus each template is associated with its orbit, induced by 
the group action on the template. Hence, given measurements 
of an observed object (for example, in the form of an image) 
registration becomes the procedure of finding the group ele- 
ment that minimizes some metric with respect to the obser- 
vation. Theoretically, in the absence of noise, the solution to 
the registration problem is obtained by applying each of the 
deformations in the group to the template, followed by com- 
paring the result to the observed realization. However, as the 



number of such possible deformations is infinite, this direct 
approach is computationally prohibitive. Hence, more sophis- 
ticated methods are essential. The analysis and the algorith- 
mic solution derived in this paper enable a rigorous treatment 
of the homeomorphism estimation problem in a wide range of 
applications. 

The center of the proposed solution is a method that re- 
duces the high dimensional problem of evaluating the orbit 
created by applying the set of all possible homeomorphic trans- 
formations in the group to the template, into a problem of an- 
alyzing a function in a low dimensional Euclidian space. In 
general, an explicit modeling of the homeomorphisms group 
is impossible. We therefore choose to solve this problem by 
focusing on subsets of the homeomorphisms group which are 
also subsets of vector spaces. This may be regarded as an ap- 
proximation the homeomorphism using polynomials, based 
on the denseness of the polynomials in the space of continu- 
ous functions with compact support. In this setting, the prob- 
lem of estimating the parametric model of the deformation is 
solved by a linear system of equations in the low dimensional 
Euclidian space. 

More specifically, consider the problem given by 
h(x!,x 2 ) = f(4>(x 1 ,x 2 )) where 4>(x 1} x 2 ) = 
(<pi(xi,X2),<f>2(%i,X2))- In the problem setting considered 
here h and / are given while <p should be estimated. In 
[5], [6], [7] we analyzed this problem and derived estimation 
algorithms of the deformation <f>, using non linear function- 
al employed to construct linear constraints on the parame- 
ters of the homeomorphisms. This paper presents a discrete 
graph representation of images and proposes a generalization 
of these functionals. The new functionals employ the graph 
representation of the image. The concept of graph representa- 
tion of images stems from digital topology [9], and has been 
employed in a verity of applications [10], [11], [13], [14]. The 
common concept behind these representations is to have the 
topology of the image compactly packed in a graph structure. 

2. GRAPH REPRESENTATION OF AN IMAGE 

Observe a continuous 2D image of an object and allow it to 
deform elastically into any shape as long as the deformation 
is a homeomorphism, (i.e. a continuous and invertible trans- 
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formation such that its inverse is also continuous). See Figure 
1. 





Fig. 1. Homeomorphisms on an image. 



Obviously, the key the to the solution of the any registra- 
tion problem is the understanding of what are the basic invari- 
ant properties of the deformed object. Clearly, its size and 
contours are not; the edges, and the straight lines in it also 
vary due to the elastic deformation. The set of colors of the 
deformed image may also change due to varying illumination 
conditions. Yet, there is a fundamental property that remains 
invariant and this is the object topology. 

It is known that homeomorphisms preserve connectivity 
and the number of holes in a set. (e.g., Figures 2 , 3) 




Fig. 2. Homeomorphisms preserve the number of holes in a 
set, not their shape. 



S*> *t t%-t# 



Fig. 3. The topology of any two sets is maintained, after 
homeomorphisms . 



Now, let us treat an image as a collection of colored patches. 
Each patch is a connected set in R 2 adjacent to different patches. 
Hence, the neighboring patches of any patch remain its neigh- 
bors following any homeomorphism. (See, e.g., Figure 4) 



Fig. 4. The topology of a complex structure of sets is invariant 
to homeomorphisms. 

Denoting the set of all bounded and measurable functions 
with compact support from X c R 2 into Y C R by M, the 
previous understanding leads us to define the following trans- 
formation from M into the set of weighted graphs: f(x, y) — ► 
< E, V > by the following rule: 



uev^uc f- l (i) 

< vi,v 2 >e E ^=^ cl(vi) n cl(v 2 ) ^ 



(1) 



where i £ Im{f} is the weight of the vertex v ,v C X is 
a connected set, and cl means the closure of a set. The rep- 
resentation is non bijective: An image is represented by one 
graph only, however, a single graph represents many images. 
The motivation to represent an image by a weighted graph 
is that graph representation is homeomorphism-invariant. (See, 
e.g. Figure 4, 5). The main disadvantage of this represen- 
tation is that apart from geometric deformation, real images 
suffer from intensity variations that may change their graph 
representation. 

3. PROBLEM DESCRIPTION 

In this section we shall briefly set the mathematical frame- 
work we adopt in order to formalize the analysis of the defor- 
mation estimation problem. This framework enables accurate 
representation and analysis of our problem, leading to rigor- 
ous criteria on the existence and uniqueness of the solution, 
and under some mild restrictions to be explained below to the 
derivation of an explicit solution. We note that due to the 
inherent physical properties of the problem, it is natural to 
model and solve it in the continuous domain. Inherently, the 
mapping <j> of R 2 into itself is of a continuous nature, as is 
the physical phenomenon of geometric deformation of real- 
life objects it represents. Thus, if we impose a discrete model 
(e.g., (xi, x 2 ) G Z 2 ), we find that, in general, the natural <fi to 
consider is incompatible (as for "almost all" (xi,x 2 ) <G Z 2 , 
(4>i(xi,x 2 ),(j) 2 (xi,x 2 )) ^ Z 2 ). Thus, the problem and its 
solution are formulated in the continuous domain, while the 
sampling and quantization effects that accompany the digital 
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Fig. 5. Several instances of the same object that are distinct 
from each other by homeomorphisms, however they are all 
represented by the same graph. 



of functions that don't belong to Mq, where G is the affine 
group, include any constant function defined on all of R 2 ; any 
periodic function defined on all of R 2 ; and functions with ra- 
dial symmetry, such as a circle (as SC>2(-R) C GL2(i?)). Note 
however that functions with compact support are not transla- 
tion nor scale invariant. 

3.2. Problem Statement 

In the following we assume that G is the group of differen- 
tiable homeomorphisms such that each element of G has a 
continuous and differentiable inverse, where the derivative of 
the inverse is also continuous. The group G lies in the norm 
space C(X) of continuous real-valued functions of X. By the 
above assumption every (j)^ 1 , {<t>~ 1 )' G C(X). Since C(X) 
is a normed separable space, there exists a countable set of 
basis functions {e^} C C(X), such that for every cj> G G, 



-yc*) = Ew*) • 



(2) 



implementation of the method, are handled as noise contribu- 
tions. 

3.1. Group Theory Setting 

Let M denote the space of compact support, bounded, and 
Lebesgue measurable (or more simply, integrable) functions 
from R 2 to R. Let x be some vector in R 2 . 

Let G be a group representing the set of deformations the 
objets may undergo. G is said to act as a transformation 
group on M if there is a mapping G x M — > M, denoted 

by O, /) i-» / ° 4> = /(0( x )) such that (/ ° <M ° fa = 

fo(4>iO(j)2) for every (pi,<j>2 G Gand / G M; and if foe = f 
for all / <E M, where e is the identity element of G. 

For a given / G M, the set {/ o : <f> g G} is called the 
orbit of /. It is the entire set of possible observations on the 
object - the result of applying to it any of the deformations in 
the group. 

The stabilizer of the function f € M with respect to the 
group G is the set of group elements <p E G such that / o <fi = 
f, i.e., the set of group elements that map / to itself. 

Thus the group G naturally defines an equivalence relation 
on M in terms of the orbits of M induced by the action of G: 
Any two functions h and / are equivalent if they are on the 
same orbit, i.e., if there exists some <fi E G such that fo<p = h. 

Let Mq C M be the subset of functions in M with no 
group symmetry, i.e., the set of functions in M whose stabi- 
lizer is trivial and includes only e, the identity element of G. 
Thus, Mq is the subset of functions in M where uniqueness of 
the solution to the defined problem is guaranteed in the sense 
that if h, f G Mq such that they are on the same orbit, then 
there exists a single <p such that / o (p = h. In [8] we show 
that Mq is dense in M in the L 2 norm. In contrast, examples 



In other words, it is assumed that every element in the 
group and its derivative can be represented as a convergent 
series of basis functions of the separable space C(X). Our 
goal then, is to obtain the expansion of </> _1 (x) with respect 
to the basis functions {ei(x)}. In practice, the series (2) is 
replaced by a finite sum, i.e., we have 1 < i < m. 

Given two bounded, Lebesgue measurable functions 
h,g G Mq with compact supports, such that 



Mx) - /(0(x)) , cf, e G, x g R 2 



(3) 



the problem is to find the deformation <f>. As indicated above, 
the direct approach for solving the problem of finding the pa- 
rameters of the unknown transformation <j> G G is to apply 
the set of all possible transformations, (i.e., every element of 
G), to the given template /, thus evaluating the entire orbit 
of /. Since h and / are homeomorphic, one of the points on 
the orbit represents the action of the desired group element <f>. 
Nevertheless, since (f> is modeled by m parameters, it is clear 
that implementation of such a search on the orbit requires a 
search over an m-dimensional manifold embedded in an infi- 
nite dimensional function space, which is infeasible. 

In this paper we show that the problem of finding the pa- 
rameters of the unknown elastic transformation, whose direct 
solution requires a highly complex search in a function space, 
can be formulated as an explicit parameter estimation prob- 
lem. Moreover, it is shown that the original problem can be 
formulated in terms of an equivalent problem which is ex- 
pressed in the form of a linear system of equations. From 
every subgraph of the graph that represents the template we 
obtain a linear constraint in the unknown parameters of the 
transformation. A solution of this linear system of equations 
provides the unknown transformation parameters. In Section 
4 we show how the problem of finding the parametric model 
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of the deformation can be transformed using a set on non- 
linear graph-based functionals into a set of linear equations 
which is then solved for the transformation parameters. How- 
ever, before getting in Section 4 into the details of this new 
representation of the problem, we shall briefly elaborate on 
the mathematical construction that enables it. 



3.3. The Mathematical Structure and the Fundamental 
Commutative Property 

Recall that Mq is the space of compact support, bounded, 
measurable functions, with no group symmetry. Let L^ be the 
mapping from Mq to itself induced by the group G, such that 
L ( p(f(x)) — f((f)(x)) for every / € Mq and every <fi € G. 
Since L 4> (af 1 (x) + bf 2 (x)) = aL ct> (f 1 (x)) + bL (f ,(f2(x)),we 
have that L^ is a linear operator. Thus, the problem we ad- 
dress can be restated as follows: Given the pair L^ (f(x)), f(x) 
find the linear operator L$. 

Towards this goal, let us define an operator w such that: 



4. LINEAR CONSTRAINTS FROM DEFORMED 
IMAGE 

To simplify the notation and the accompanying discussion we 
present the solution for the case where the observed signals 
are one-dimensional. The derivation for higher dimensions 
follows along similar lines. Consider the problem formulated 
in (3) and let z — <j>(x). Then (p~ 1 (z) = x, and hence 

{4T 1 )'{z)dz = dx (6) 

Let us choose any p elements {g p } p= i C G* , and as 
we show next, these elements are employed to translate the 
identity relation (3) into a set of P equations: 



w(h(x), g p )dx 



w {f{<t>{x)),9p)dx 
( ( p- 1 (z))'w(f(z),g p )dz 



w: M G xG 



Mr 



(4) 



where G* is the set of weighted graphs in which the weight 
of each vertex consists of two elements of Y: 



G* 



{<V,E,i;>,i;:V^Y 2 ;TP=(iP 1 ^ 2 )} (5) 



The operator w acts on an image / e Mq and a graph 
g p E G* by finding all of the graphs g p as subgraphs of the 
graph that represents the image /. To each detected vertex 
v with an intensity ip (v) the operator assigns a new value 
tp 2 (v) and eliminates the graph vertices that weren't found. 
Thus a new image is formed with non zeros values only at 
locations that satisfy the definition of the subgraph g p . This 



mapping, denoted by M, 



'h- 



M 



M, is defined as follows: 



Mg p {f(x)) — w(f(x),g p ), g p £ G*. The fundamental prop- 
erty being exploited in this paper in order to reduce the origi- 
nal high dimensional problem to an equivalent problem that is 
linear in the unknown transformation parameters is the com- 
mutative property of the left composition operator w and the 
right composition operator <j>, stated explicitly in the next the- 
orem: 

Theorem 1. Let f, h e M G and g p e G*. Then L ( p(f(x)) = 
h(x) implies that L < p(w(f(x), g. p )) — w(h(x), g p ). In a more 
concise form: M Sv {L^) = L^{M gp ) 

Proof. [M flp (L )](/(x)) = M gp {f{(j>{x)) = w(f(<f>(x)),g p ) 
= L4w(f(x),g p )) = [L^(M gp )](f(x)) □ 



Thus, knowing how L^ acts on some function /, we know 
the action of L^ on any function w(f(x),g p ), for any g p £ 
G*. 



E & * 



P =i,. 

Rewriting (7) in a matrix form we have 
fw{h,gi) 



ei(x)w(f(x),g p )dx 

P (7) 



Jw(h 7 g p ) 
feiw{f,9i 



Jeiw(f,g p ) 



fe m w{f,gi) 

J e m w(f,g p ) 



6i 



(8) 



Based on the fact that the operator w is homeomorphism in- 
variant, we have the following theorem: 

Theorem: The homeomorphism <p satisfying the paramet- 
ric model defined in (2) is uniquely determined iff the matrix 



/ei w(/,5i) 
. Jeiw(f,g p ) 



Je m w(f,g!) 
Je m w(f,g p ) 



(9) 



is full rank. 

Thus, provided that {g p } p= i are chosen such that (9) is 
full rank, the system (8) (in the absence of noise we take P — 
m) can be solved for the parameter vector [b\ , . . . , b m ] . It is 
clear that in the absence of noise, any set of weighted graphs 
{g p } p n =1 such that (9) is full rank is equally optimal. 

5. NUMERICAL EXAMPLES 

The following example illustrates the proposed solution for 
elastic image registration. The template was taken to be an 
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RGB image of dimensions 314 x 314, and the observation is 
an elastic deformed version of it (Figure (6)) where the defor- 
mation function is: 
<j>(x, y) = (-0.75a; - 1.29y - 1.35x 2 , 1.29s - 0.75y) 







| # fi ^ i 



Fig. 8. The operator application on the template and observa- 
tion based on all sub-graphs with two vertices. 



Fig. 6. Template and elastically deformed observation. 



The estimation does not involve any search scheme, but 
only the application of the graph representation of each im- 
age and our linear model. Figure 7 depicts the operation of 
functionals that are based on all sub-graphs with one vertex 
only. In this example each vertex corresponds to a subset of 
the image color space that consist of 100 3 colors out of the 
entire RGB color space of 256 3 colors. The label assigned 
to each vertex is an indicator function of the corresponding 
color cube. Figure 8 depicts the operation of functionals that 
are based on all sub-graphs with two vertices, using indica- 
tor functions as vertices labels. Figure 9 depicts the operation 
of functionals that are based on another type of one-vertex 
sub-graph where different Gaussian functions of the observed 
intensities are employed as vertices' labels. The estimated 
deformation obtained using the constraints derived using all 
these subgraphs is given by: 
<j>(x,y) = (-0.73a;-1.252/-1.3a; 2 ,1.29a;-0.76j/™0.01a; 2 ). 




Fig. 7. The operator application to the template and observa- 
tion using all sub-graphs with one vertex. 



6. CONCLUSIONS 

In this paper we have considered the problem of finding the 
transformation relating a given observation on a planar ob- 




Fig. 9. The operator application on the template and observa- 
tion based on one vertex sub-graph with Gaussian weighted 
labels on its vertices. 



ject with some pre-chosen template of this object. The di- 
rect approach for estimating the transformation is to apply 
each of the deformations in the group to the template in a 
search for the deformed template that matches the observa- 
tion. The notion that a weighted graph represents the topol- 
ogy of images was presented. A method that employs a set 
of non-linear graph-based functionals to replace the original 
high dimensional problem by an equivalent linear problem, 
expressed in terms of the unknown transformation parame- 
ters, was derived. The resulting method is explicit and global. 
It deals with any elastic deformation, and the obtained map 
(p = H(h, /) is continuous and involves only elementary lin- 
ear analysis in the same dimension as that of the group model. 
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Abstract 

The paper is devoted to state estimation problems for nonlinear dynamic 
systems with system states being compact sets. The studies are motivated 
by the theory of dynamical systems with unknown but bounded uncer- 
tainty without its statistical description. The trajectory tubes of differ- 
ential inclusions are introduces as the set- valued analogies of the classical 
isolated trajectories of uncertain dynamical systems. Applying results 
related to discrete-time versions of the funnel equations and techniques 
of ellipsoidal estimation theory developed for linear control systems we 
present new approaches that allow to find the outer and inner estimates 
for such set-valued states of the uncertain nonlinear control system. Nu- 
merical simulations are also given. 

Key word: Differential inclusions; Uncertain dynamic system; State con- 
straints; Viability theory; Trajectory tube; Funnel equations; State estimation; 
Ellipsoidal approach. 

1 Introduction 

The topics of this paper come from the theory of dynamical control systems with 
unknown, but bounded uncertainties (the case of the so-called " set-membership" 
description of uncertainties) [3, 5, 11, 14, 15, 17, 18]. The motivations for these 
studies come from applied areas ranged from engineering problems in physics 
to economics as well as to ecological modelling. The paper presents recent 
results in the theory of tubes of solutions (trajectory tubes) to differential control 
systems modelled by nonlinear differential inclusions with uncertain parameters 
or functions. 
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We will start by introducing the following basic notations. Let R n be the 
n-dimensional Euclidean space and x'y be the usual inner product of x, y e R n 
with the prime as a transpose, with || x \\ — {x'x) 1 / 2 . Denote comp R n to be 
the variety of all compact subsets A C R n and conv R n to be the variety of all 
compact convex subsets A C R n . Let us denote the variety of all closed convex 
subsets A C R n by the symbol clconv R n . 

Consider the ordinary differential equation 

x = f(t,x,u) (1) 

with function / : T x R n x R n — > i? m measurable in £ and continuous in the 
other variables. Here x stands for the state space vector, t stands for time 
(t € T = [tQ,ti]) and u is a control or a disturbance. The variables u in (1) are 
assumed to be bounded 

ueQ(t,x) (2) 

where Q(t, x) is a set-valued map (Q : T x i?" — ► compi? m ) measurable in i and 
continuous in x. The given data allows to consider a set- valued function 

F(t,x) = \J{f(t,x,u) | ueQ(t,x)} (3) 

and further on, a differential inclusion [2, 4, 8] 

xeF(t,x) (4) 

that reflects the variety of all models of type (l)-(2). 

Let us assume that the initial condition to the system (1) (or to the differ- 
ential inclusion (4)) is unknown also but bounded 

a;(* ) = x°, x° eX° e compi?" (5) 

One of the principal points of interest of the theory of control under uncer- 
tainty conditions [14] is to study the set of all solutions x[t] = x(t,to,x°) to 
(l)-(5) (respectively, (4)-(5)) and furthermore the subset of those trajectories 
x[t] = x(t,t ,x°) that satisfy both (4)-(5) and a restriction on the state vector 
( the "viability" constraint [1]) 

x[s]€Y(s), s€[t ,t] (6) 

where Y(-) ( Y{t) G conv_R p ) is a convex compact valued multifunction. 

The viability constraint (6) may be induced by state constraints defined for 
a given plant model or by the so-called measurement equation [14] 

y(t) - G(t)x + w, (7) 

where y is the measurement, G(t) — a matrix function, w — the unknown but 
bounded " noise" and 

weQ(t), Q(t) e compR p . 
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The problem consists in describing the set X[-] = {#[•] = x(-,to,x )} of 
solutions to the system (4)- (5) (or to the system (4)- (6), that is the viable 
solution bundle or "viability bundle"). The point of special interest is to describe 
the t - cross-section X[t] of this set that is actually the attainability domain of 
system (1), (4), (5) at the moment t. The set X[t] may be considered also as the 
set- valued estimate of the unknown state x(t) of the system of relations (4), (5) 
and (6). This estimate X[t] as a set-valued function of t £ [to^i] is called the 
viability tube or the viable solution tube. The viability tubes were considered in 
some aspects in the theory of differential games [12, 13] and in nonlinear control 
synthesis problems [16]. 

The paper deals with the problems of control and state estimation for a dy- 
namical control system described by differential inclusions with unknown but 
bounded initial state. The solution to the differential system is studied through 
the techniques of trajectory tubes with their cross-sections X{t) being the reach- 
able sets at instant t to control system. 

Basing on the well-known results of ellipsoidal calculus developed for linear 
uncertain systems we present the modified state estimation approaches which 
use the special nonlinear structure of the control system and simplify calcu- 
lations. Examples and numerical results related to procedures of set-valued 
approximations of trajectory tubes and reachable sets are also presented. 

2 Preliminaries 

We assume that the notions of continuity and measurability of set- valued maps 
are taken in the sense of [4]. 

Consider the differential inclusion (4) , where x £ R n , F is a continuous mul- 
tivalued map (F : [to,t\] x R n — > convi?™) that satisfies the Lipschitz condition 
with constant L > 0, namely 

h(F(t,x),F(t,y)) <L\\x-y\\, Vx,y £ R n 

where h(A, B) is the Hausdorff distance for A,BC R n , i.e. 

h{A,B) =max {h + '(A, B) , h~ {A, B)} , 

with h + (A, B) , h~ (A, B) being the Hausdorff scmidistances between the sets 
A,B, 

h+(A, B) = sup{d(a;, B) \ x e A}, h~{A, B) = h+(B, A), 

d(x,A) =inf {|| x-y \\ | y £ A}. 

Assuming a set X £ compi?" to be given, denote x[t] — x(t 7 t ,x ) ( t £ 
T = [£o,*i] ) to be a solution to (4) (an isolated trajectory) that starts at point 
a; [to] = x e X . 

We take here the Caratheodory-type trajectory ir[-], i.e. as an absolutely 
continuous function x[t] (t £ T) that satisfies the inclusion 

j t x[t]=x[t]GF(t,x[t]) (8) 
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for almost every t E T. 

We require all the solutions { x[t] — x(t, to, x ) \ xo € X } to be extendable 
up to the instant t\ that is possible under some additional assumptions [8]. 

Let Y(t) be a continuous set- valued map (Y : T — > convi? n ), Xq C Y(to)- 

Definition 1 [1, 15] A trajectory x[t] = x(t,to,Xo) {xo € Xq, t e T) of 
the differential inclusion (8) is called viable on [to,r] if 

x[t] E Y(t) for all t E [to,r\. (9) 



We will assume that there exists at least one solution x*[t] — x*(t,to,Xo) of 

(8) ( together with a starting point x* [to] = Xq € Xo ) that satisfies condition 

(9) with r = t 1 . 

Let X(-,t ,Xo) be the set of all solutions to the inclusion (8) that emerge 
from X ( the "trajectory bundle"). Denote X[t] — X(t 1 to, Xo) to be its crossec- 
tion at instant t. 

The subset of X(-, to, X ) that consists of all solutions to (8) viable on [to, r] 
will be further denoted as X(-,r,t ,Xo) (the "viable trajectory bundle") with 
its s - crossections as X(s,T,t ,Xo), s e [io,' r ]- We introduce symbol X[t] for 
these crossections at instant r, namely 

X[t] - X(T,to,X ) = X(T,T,to,X ) 

It is known that both maps X(t,to, Xq), X(t,to, Xo), 

X :TxT x compiT -*■ compi? n , 

X :TxT x compiT -> compi? n , 
satisfy the semigroup property: 

X{t,T,X{T,t ,X )) = X(t,t ,X ), t <T<t<t!, 
X{t,T,X{T,t ,X )) = X{t,t ,X ), to<T<t<t U 

and therefore define the generalized dynamic systems with set-valued trajec- 
tories [3, 20, 15]. The multivalued functions X[t] and X[t] ( t € T) will be 
referred to as the trajectory tube and viable trajectory tube (or viability tube) 
respectively. They may be considered as the set- valued analogies of the classical 
isolated trajectories constructed now under uncertainty conditions. 

One of the approaches that we discuss here is related to the evolution equa- 
tion of the "funnel type" that describes the dynamics of set-valued "states". 
The basic assumptions on set-valued map F(t, x) for the following results to be 
true may be found in [15, 9]. 

Let us consider the "equation" 

lim a- x h( X [t + a], II (x + <rF(t, x)) ) = 0, t£T=[t ,t 1 ] (10) 

xex[t] 
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with "initial condition" 

X[to] = X . (11) 

We can observe that this equation is the formal analogy of the ordinary 
differential equation when mappings F(t, x) — {f(t, x)} and X[t] — {x[t]} (X = 
{xo}) are single -valued. 

Theorem 1 [19] The multifunction X[t] = X(t,to, X ) is the unique set-valued 
solution to the evolution equation (10)-(11). 

Other versions of funnel equation (10) may be considered by substituting 
the Hausdorff distance h for a semidistance h + [16]. The solution to the /in- 
versions of the evolution equation may be not unique and the "maximal" one 
(with respect to inclusion) is studied. 

Now let us consider the analogy of the funnel equation (10)-(11) but now for 
the viable trajectory tubes X[t] = X(t,t n ,X ): 

lim a' 1 h{X[t + a], II (x + aF(t,x))C]Y(t + a) ) = 0, t€T, (12) 
xex[t] 

X[t ]=X Q . (13) 

The following result was proved in [15, 9] under assumptions of different type 
concerning mappings F(t,x) and Y(t). 

Theorem 2 [9, 15, 16] The multivalued function X[t] = X(t,to,Xo) is the 
unique solution to the evolution equation (12)-(13). 

3 Problem Statement 

It should be noted that the exact description of reachable sets of a control 
system is a difficult problem even in the case of linear dynamics. The estimation 
theory and related algorithms basing on ideas of construction outer and inner 
set-valued estimates of reachable sets have been developed in [17, 5] for linear 
control systems. 

In this paper the modified state estimation approaches which use the special 
quadratic structure of nonlinearity of studied control system and use also the 
advantages of ellipsoidal calculus [17, 5] are presented. We develop here new 
ellipsoidal techniques related to constructing external and internal set-valued 
estimates of reachable sets and trajectory tubes of the nonlinear system. Some 
estimation algorithms basing on combination of discrete-time versions of evo- 
lution funnel equations and ellipsoidal calculus [17, 5] are given. Examples 
and numerical results related to procedures of set- valued approximations of tra- 
jectory tubes and reachable sets are also presented. The applications of the 
problems studied in this paper are in guaranteed state estimation for nonlinear 
systems with unknown but bounded errors and in nonlinear control theory. 
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The paper deals with the problems of control and state estimation for a 
dynamical control system 

x(t) = A(t)x(t) + f(x(t)) + G(t)u(t), (14) 

x e R n , t <t<T, 
with unknown but bounded initial condition 

x(t Q ) = x , x e X , X c R n , (15) 

u(t) e U, U C R m , for a.e. t e [t ,T\. (16) 

Here matrices A(t) and G(t) (of dimensions n x n and n x m, respectively) are 
assumed to be continuous on i 6 [to,? 1 ], Xo and U are compact and convex. 
The nonlinear n-vector function f(x) in (14) is assumed to be of quadratic type 

/(*) = (/i Or),..., / n (a0), 

fi(x) = x'BiX, i = l,...,n, (17) 

where B, is a constant n x n - matrix (i = 1, . . . , n). 

Consider the following differential inclusion [8] related to (14)— (16) 

x(t) eA(t)x(t) + f(x(t))+P(t), for a.e. te [t ,T\, (18) 

x(t ) = x e Xo, 

where P(£) = G(t)i7. 

Let absolutely continuous function x(i) = x(t 7 t 0} x ) be a solution to (18) 
with initial state xq satisfying (15). The differential system (14)— (16) (or equiv- 
alcntly, (18)) is studied here in the framework of the theory of uncertain dy- 
namical systems (differential inclusions) through the techniques of trajectory 
tubes 

X{-, t , X Q ) = {x(-) = x{; t , x ) I x G X Q } (19) 

of solutions to (14)-(16) with their t-cross-sections X(t) = X(t,to,Xo) being 
the reachable sets at instant t for control system (14)-(16). 

The problem consists in describing the set X(-) = U Xoe x {x(-) — x(-, t , £o)} 
of solutions to the differential inclusion (18) under constraint (6) (the viable 
trajectory tube). The point of special interest is to describe the t - cross- 
section X(t) of this map that is actually the attainability domain of this system 
at the instant t. 

Basing on results of ellipsoidal calculus ([17, 5]) developed for linear uncer- 
tain systems we present here the modified state estimation approaches which 
use the special structure of nonlinearity of studied control system (14)-(17) and 
combine advantages of estimating tools mentioned above. 
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4 External Estimates of Reachable Sets and Tra- 
jectory Tubes 

We denote as B(a,r) the ball in R n , B(a,r) = {x e R n : || x — a || < r}, I is the 
identity n x n-matrix. Denote by E(a,Q) the ellipsoid in R n , E(a,Q) = {x <G 
R n : (Q~ l (x — a), (x — a)) < 1} with center a € R n and symmetric positive 
definite n x n-matrix Q. For any n x n-matrix Q denote its track as Tr Q and 
its determinant as \Q\. 

4.1 Outer Ellipsoidal Bounds 

The approach presented here uses the techniques of ellipsoidal calculus devel- 
oped for linear control systems [5, 17]. It should be noted that external ellip- 
soidal approximations of trajectory tubes may be chosen in various ways and 
several minimization criteria are well-known. We consider here the ellipsoidal 
techniques related to construction of external estimates with minimal volume 
(details of this approach and motivations for linear control systems may be 
found in [5, 17]). 

Assume here that P(t) = E(a,Q) in (18), matrices Bi (i = l,...,n) are 
symmetric and positive definite, A(t) = A. We may assume that all trajectories 
of the system (18)-(15) belong to a bounded domain D = {x e R n :|| x \\< K} 
where the existence of such constant K > follows from classical theorems of 
the theory of differential equations and differential inclusions [8] . 

From the structure (17) of the function / we have two auxiliary results. 
Their proofs are based on the algebraic properties of quadratic forms and are 
omitted here. 

Lemma 1 The following estimate is true 



f(x)\\<N, N = K 2 C£ x2 i) 1 



/2 
i=l 



where Xi is the maximal eigenvalue for matrix Bi (i = 1, ...,n). 

Lemma 2 For all t £ [to,T] the inclusion 

X(t)cX*(t) 

holds where X*(-) is a trajectory tube of the linear differential inclusion 

x e Ax + B(c, ^N/2), x e X , (20) 

where c = {N/2, ..., N/2} e R n . 

The following theorem gives the external estimate of the trajectory tube 
X(i) of the differential inclusion (18). 
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Theorem 3 Let X = 5(0, r), r < K and 

r K-r 1 

u = mm{ \7M ; z ;T} - 

Then for all t G [to, t*\ the following inclusion is true 

X(t,t 0l X Q )cE(a+(t),Q + (t)) 1 (21) 

where 

M = KVX + N + P, P = XX + v ^, 

/ n \ V2 

2 I 



L = V\ + 2KlJ2 X 



with X, X{ and X being the maximal eigenvalues of matrices A A' , Bi (z = 1, ..., n) 
and Q respectively, and vector function a + (t) and matrix function Q + (t) satisfy 
the equations 

a + = Aa + + a + c, a + (t o )=0 (22) 

Q+ = AQ+ + Q+A T + qQ+ + q- l Q\ 

q = {n- 1 Tr((Q+)-iQ*)y/i, (23) 

Q + {to) = Qo = r 2 I. 

Here 

nN 2 
Q* = (p- 1 + l)Q+(p+l)Q, Q=—h (24) 

and p is the unique positive solution of the equation 

n . 
V^ n 

^P+oh ~Kp+i)' 

with <Xi > (i = 1, ...,n) being the roots of the following characteristic equation 

\Q-aQ\=0. (26) 

Proof. Applying Lemmas 1-2 and the ellipsoidal techniques [5, 17] and compar- 
ing the inclusions (18) and (20) we come to the relation (21). 
Example 1. Consider the following control system 

Xl " 6X 2 1+U 2 l1 0<t<T, (27) 

x 2 = x( + x£ + u 2 , 

X a = 5(0, 1), P(t) = 5(0, 1), T = 0.15, K = 2.6. (28) 

Results of computer simulations based on the above theorem for this system are 
given at Fig. 1. 
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^E(a*(t), Q*(t))' 

0.05 _ 15 -1 -0.5 



0.5 1 

X1 



1.5 2 



Figure 1: Reachable sets X(t,to,Xo) and their outer ellipsoidal estimates 
E(a+(t),Q+(t)) (here t* = v^/29.2). 



4.2 Upper Ellipsoidal Bounds via Funnel Equations 

Let us discuss the estimation approach based on techniques of evolution funnel 
equations. Consider the following system 



x = Ax + f(x)d, xq e Xq, to < t < T, 



(29) 



where x G R n , \\x\\ < K, d is a given n-vector and a scalar function f(x) has a 
form /(x) = x' Bx with a symmetric and positive definite matrix B. 

Note that the direct application of funnel equations for finding trajectory 
tubes X(t) is very difficult because it takes a huge amount of computations 
based on grid techniques. The following theorem related to our special case 
of nonlinearity presents an easy computational tool to find estimates of X(t) 
by step- by-step procedures. For a simpler case of system nonlincarities the 
approach was presented in [10]. 

Theorem 4 Let X = E(a, PS -1 ) with k ^ 0. Then for all a > the follow- 
ing inclusion holds 



X(t + a, t Q , Xo) C E(a(cr),Q((T)) + o(a)B(0, 1), 



where 



a(o~) = a + a(Aa + a! Ba ■ d + k d), 
Q(a) = k 2 (I + o-R)B- x {I + aR) 1 , R = A + 2da B 
and linio-^+o cr _1 o(cr) = 0. 



(30) 

(31) 
(32) 
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Proof. The funnel equation for (29) is 

lim cr^ 1 h(X(t + (T 7 t 07 X ), | j {x + a(Ax+ 

cr^ + O ^ 

x£X(t,t ,X ) 

+ ~f{x)d)} = 0, t € [t ,T], X(t ,to,Xo) = X . (33) 

If x € dX where dX means the boundary of X , we have 

f(x ) = k 2 + 2a' Bx - a'Ba 

and from (33) we have also 

(J {{I + aA)x + af{x )d}= (J {(I + aR)x a + a(k 2 - a'Ba)d}. (34) 

x €dX xo£3Xo 

Note that if the ellipsoid in (39) gives the tube estimate for the system with 
dXo as starting set, then also for the system with Xq as starting set. Applying 
Theorem 1 and taking into account the equality (5) and the above remark we 
come to the estimate (39). 

We may formulate now the following scheme that gives the external estimate 
of trajectory tube X(i) of the system (29) with given accuracy. 

Algorithm 1. Subdivide the time segment [ioj^l into subsegments [t», t*+i] 
where ij = to + ih (i = 1, . . . , m), h = (T — to)/m, t m = T . 

• Given X = E(a, k^B^ 1 ) with k ^ 0, define X x = E(ai, Q\) from Theo- 
rem 4 for oi = a(a), Q± = Q{cr), o = h. 

• Find the smallest constant k\ such that 

£(oi,Qi)cXi = £(oi,fc?B- 1 ), 

and it is not difficult to prove that k\ is the maximal eigenvalue of the 
matrix B 1 / 2 Q 1 B 1 / 2 . 

• Consider the system on the next subsegment [ti, t?\ with E(a±, kfB -1 ) as 
the initial ellipsoid at instant t\. 

• Next steps continue iterations 1-3. At the end of the process we will get 
the external estimate E(a(t),Q(t)) of the tube X(t) with accuracy tending 
to zero when m —>■ oo. 

Consider the estimation of the viable trajectory tube X(t) of the system (29) 
under constraint (6). We modify Algorithm 1 taking into account the viability 
constraint (6) where we take Y(t) = Y = E(yo,D). 

In this case from Theorems 2-4 we have the main inclusion 

X(to + a,t ,X Q )CE(a(a),Q(a))nE(y ,D) + 
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+ o(a)B(0,l), X = E(a,k 2 B~ 1 ), (35) 

which allows to formulate the modified algorithm which is more complicated 
now (all notations in (35) are taken from Theorem 4). 

Algorithm 2. Subdivide the time segment [ioj^l into subsegments [i,,£j+i] 
where t{ = to + ih (i = 1, . . . , m), h = (T — to)/m, t m = T. 

• Given X = £(a, fcgB- 1 ) with k Q ^ 0, define X 1 = E(ai,Qi) from (35) 
(as in Algorithm 1) for a\ = a(a), Q\ — Q(cr), a = h. 

• Consider the intersection of ellipsoids X\ = E(ai,Qi) and Y(t) = Y = 
E(jjq,D) and find the smallest (with respect to some criterion, e.g. as in 
[5]) ellipsoid X{ = E(al,Ql) such that 

E(ai,Qi)nE(yo,D)cE(al,QT). 

• Find the smallest constant k\ such that 

E(a* l7 Ql) C Xi = Eia^klB- 1 ), 
k\ is the maximal eigenvalue of the matrix B 1 / 2 Q\B 1 / 2 . 

• Consider the system on the next interval [t l3 1 2 ] with E(al, k^B" 1 ) as the 
initial ellipsoid taken at initial instant t\. 

• Next steps continue iterations 1-4. At the end of the process we will get the 
external estimating tube E(a*(t),Q*(t)) of the tube X(i) with accuracy 
tending to zero when m — > oo. 

Example 2. Consider the following system 

Xi = -Xi, 

1 ,x\ 2s (36) 

xi = ^x 2 + 3(—+x 2 ), 

X a = E(0,Q o ), Qo=(q J V (37) 

Here d — {0,3}, f{x) = x'Bx with B — Qq 1 , T = 0.15. Results of computer 
simulations based on Theorem 4 are shown at Fig. 2. 

Assume now that the state constraint (6) is also present for the system (36) 
withF = B(0,r), r = 2.4. 

Applying Algorithm 2 we discover that the viability constraint becomes im- 
portant in estimation only after 10th iteration so we may use there the simpler 
Algorithm 1. After that beginning with the 11th iteration the whole four-steps 
procedure works. Fig. 3-4 illustrate this estimation process. 
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Q(t» 




Figure 2: Trajectory tube X(t,t ,X a ) and its external ellipsoidal tube 
E(a(t),Q(t))- 



5 Internal Ellipsoidal Estimates 

Consider now the internal set- valued estimates of reachable sets X(i) of the 
uncertain nonlinear system (29). As in the previous section we formulate first 
the following auxiliary result. 

Theorem 5 Let Xq = E(a,k 2 B^ 1 ) with k ^ 0. Then for the trajectory tube 
X(t) of the system (29) and for all a > the following inclusion holds 

E(a-{a),Q-{a))CX(t + a)+o{a)B(0,l), lim ct -1 o(ct) = 0, (38) 

a— >+0 



where 

a~{o) = a(o~) + era, 

Q-(o-) = Q(a) + a 2 Q + 2aQ(o-) 1 / 2 (Q(a)- l / 2 QQ(a)- 1 / 2 ) 1 / 2 Q(a) 1 / 2 , 
and a(o~), Q(u) are defined in Theorem 4- 



(39) 



Proof. The proof of this result is similar to the proof of Theorem 3 [10]. 
Based on this result we may formulate the following scheme that gives the 
internal estimate of trajectory tube X(i) of the system (29). 

Algorithm 3. Subdivide the time segment [to, T] into subsegments [tj, i,+i] 
where ij = to + ih (i = 1, . . . , m), h = (T — to)/m, t m = T . 

• Given Xq = E(a, k 2 B~ x ) with fco ^ 0, define X\ = E{a\, Q{) from Theo- 
rem 5 for ai = a~(er), Q\ = Q~(o~), a = h. 
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Figure 3: Viable trajectory tube X(t, to,Xo) and its external ellipsoidal tube 
E(a*(t),Q*(t)). 



• Define k\ as the minimal eigenvalue of the matrix B^-^QiB 1 / 2 . 

• Consider the system on the next subsegment [t\, t 2 ] with E(a\, kfB -1 ) as 
the initial ellipsoid at instant t\. 

• Repeat consequently the steps, at the end of the process we will get the 
internal estimate E(a(t), Q(t)) of the tube X(i) with accuracy tending to 
zero when m — > 00. 

Note that fci defined at second step of the algorithm is the largest positive 
constant such that E(a\,kiB~ 1 ) c E(a\,Q\). 
Example 3. Consider the following system 



X\ = X\ 
X2 = 



x\ + : 
x 2 + u 2 , 



Ml, 



< t < T. 



(40) 



Here t = 0, T = 0.3, h = 0.025, a = a = (0,0), Q = Q = I. Results of 
computer simulations based on Theorem 5 are shown at Fig. 5. 

6 Conclusions 



The paper deals with the problems of control and state estimation for a dy- 
namical control system described by differential inclusions with unknown but 
bounded initial state. 

The solution to the differential system is studied through the techniques 
of trajectory tubes with their cross-sections X(t) being the reachable sets at 
instant t to control system. 
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Figure 4: Steps 1-4 of 11th iteration of Algorithm 2. 



Basing on the results of ellipsoidal calculus developed for linear uncertain 
systems we present the modified state estimation approaches which use the 
special nonlinear structure of the control system and simplify calculations. 

Examples and numerical results related to procedures of set-valued approx- 
imations of trajectory tubes and reachable sets were also presented. 
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Abstract 

In this paper, two new approximate formulae for fractional derivatives are de- 
veloped by means of Sine methods. The difference of the two formulae is the 
variable transformations incorporated; the single exponential transformation and 
the double exponential transformation. We give error analysis of the formulae, and 
show that these formulae archive exponential convergence. Numerical examples 
that confirm the analysis are also given. 

Keywords: fractional derivative, numerical approximation, Sine methods 

1 Introduction 

In the last few decades, mathematical models with fractional derivatives have been 
used in the fields of physics [6], engineering [16], chemistry [14], biology [9], control 
theory [3], and many others [1,2,7]. We consider two types of derivatives of order p: 
Riemann-Liouville type (Y) p a f) and Caputo's type (D^ /), which are defined by 

/ H \LpJ+i 
KM(t) = [j) [li pS - p+l f](tl t>a, (1) 



V P [fW = t Pi - p+1 



d UPHi 



*• f 



(0, t > a, (2) 
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1 INTRODUCTION 



respectively, where \ q a f is the Riemann-Liouville fractional integral of order q, 

WKO = ^r~ ry— , t > a. (3) 

In what follows, we assume p, q € (0, 1). In this case, approximating fractional deriva- 
tives with high accuracy is not an easy task, because there is a weakly singular kernel 
called the Abel kernel in (3). Typical numerical methods for fractional derivatives in 
the literature are reviewed in some books cited above and some papers [4,5]. The con- 
vergence rates of those methods are all of polynomial: 0(n~ r ), where n denotes the 
number of evaluation of /, and y is a positive constant. 

Recently, an "exponentially" converging approximate formula based on Cheby- 
shev polynomials has been proposed by Sugiura-Hasegawa [19]. In their beautiful 
work, they have extended the so-called Clenshaw-Curtis rule for the definite integral 
J f(s)ds to the fractional derivative of Caputo's type (2), and also pointed out that 
the formula is also applicable to the Riemann-Liouville type (1) through the relation 
D«[/](0 = D«[/](0 + f(a)(t - a)-P/T(l - p). They have shown that the formula con- 
verges uniformly on the given interval [a, b] with the exponential rate, 0(e~ y "), under 
the assumption that / is analytic on an elliptic domain that contains the interval [a, b]. 
In general, however, / does not satisfy this assumption. In fact, the solution of frac- 
tional differential equations may have a singularity at the endpoint, t — a, due to the 
Abel kernel [8]. In such cases, their formula loses the fast convergence. 

On the other hand, for such singular functions, it is known in the wide range of nu- 
merical analysis that Sine methods are quite effective (see, for example, Stenger [17]). 
In fact, Riley [15] employed techniques in Sine methods to approximate integrals of 
the form (3), and obtained exponential convergence, 0(e~ y *"), despite singularities in 
the kernel and the function /. This result has then been extended by Mori et al. [10] 
and the present authors [13], and it turned out that the convergence rate of the method 
can be improved to 0(e~ r "/ log "). The key in this improvement is the replacement of 
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the variable transformation; the standard Single Exponential (SE) transformation em- 
ployed in Riley's method was replaced with a stronger transformation, the so-called 
Double Exponential (DE) transformation [11, 18]. The latter methods, i.e. the Sine 
methods incorporated with the DE transformation, are called DE-Sinc methods, while 
the former ones are referred to SE-Sinc methods, accordingly. 

As a natural extension of these results, in the present paper we propose two new ap- 
proximate formulae for Caputo's fractional derivative (2); either based on the SE-Sinc 
and DE-Sinc methods. It is then shown theoretically and numerically that the con- 
vergence rate is 0(e~ y ^") in the first formula, and 0(e~ r "/ log ") in the second formula. 
These formulae are also applicable to the Riemann-Liouville fractional derivative (1) 
in the same manner as in Sugiura-Hasegawa [19]. 

This paper is organized as follows. The main results are stated in Section 2. In 
Section 3, we show numerical examples of the new formulae, and compare them with 
the one by Sugiura-Hasegawa. The proofs of the main theorems are given in Section 4. 

2 Approximate formulae and their error analysis 

The main tool to derive approximate formulae is the Sine approximation: 

N 

F(t) * J] F(jh)S(j,h)( T ), t e R, (4) 

J'-N 

where S (j, K){t) is the Sine function defined by S (j, K){t) - s,m{n(T/h-j)}/{n(T/h-j)}. 
The so-called Sine quadrature rule is derived by integrating the both sides of (4): 

Xoo N „co N 

F(r) dr * Yj F W S C/. *)(t) dT = h Yj F(jh) - (5) 

j=-N ^-°° j=-N 

Note that the variable r in these formulae moves on the whole real line. If the function 
to be approximated is defined on a finite domain, variable transformation should be 
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employed in (4) or (5). There are two transformations, the SE transformation and the 
DE transformation, which are defined by 

-a . /t\ b + a 



t = O t ) = — tanh (2 sinh(T) ) + ~ir- 



Both transformations map rel onto t e (a, b). Their inverse functions are: 



r-(«y"w -*(£). 



r = (*S,r'W = log 



I 10g (^) + J 1+ {I 10g (-) 



2.1 Derivation of a formula by means of the SE-Sinc methods 

Recall that Caputo's fractional derivative is defined by Df[/](0 = I fl p [/'](/)• Our 
basic idea is to approximate the integral part (\ a p ) based on the idea in Riley [15], 
and the derivative part (4) based on the idea in Stenger [17], respectively. Finally we 
combine them to approximate the target: D p /. 

First we consider the approximation of \ a p g for a given function g. Changing the 
original integral interval (a, f) to R by the variable transformation s - i/rf,(<x), we have 

i_ Pr .. . _ C™ g(riM)WI,,Y^)dcT _ (t-a?-P p g(^(^))dcr 



K P [g](t) 



J-co r(i - P )(t - riA^)) p r(i - p) lee (i + e- 



r(i - p)(f - ^» (<r)y r(i - p) J_„ (i + e -)(i + ^y- p ' 



Note that the weakly singular integrand (the Abel kernel) is translated to a smooth 
function. Applying the quadrature rule (5) to the translated integral, we obtain the 
approximate formula for the integral part: 
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Here Ms a mesh size suitably chosen depending on N, which will be described later. 

Next we consider the approximation of /', Let us define a function Q a j, as Q a ,bit) = 
(t - a)(b - t). Putting F(r) = /(^(r))/e a , fe (^(r)) in (4), we have 

which is equivalent to: 

*L f(tff s \(jh)) 
fit) * C s *[f](t) = 2a o (d, s * ( m Qa ' h(t)S °' ^ { «)> f 6 (a, *). (7) 

Differentiating the both sides gives an approximate formula for /', i.e. /' « {C^/}'. 
Using this and (6) with g = /', we finally obtain the desired formula as follows: 

d£[/](0 = i^"[/'](0 * J"[/'](0 * J"[{C^/)'](0- (8) 

2.2 Derivation of a formula by means of the DE-Sinc methods 

We consider the use of the DE transformation instead of the SE transformation here. 
For the integral part \ a p g, we apply s - ^™(cr), then \ a p g is translated into 



la P [g](t) 



■ ■-/■■...,»- (fzftl r "COSh(cr)g(^(o-))dcr 

E(l - p) J-co (1 +e- IIsinh ( cr ))(l +e Itsinh ( £r )) 1 -/'' 



Applying the quadrature rule (5) to this integral, we obtain the approximate formula: 

*a LgJW ~ J W l£JW r( -j - p) 2-1 (1 + e -Jisinh(Mi))(l + e Jtsinh(tt)-)l-p ' 

The derivative part can be handled in the same manner. Similar to (7), using 

fit) * c^ifm = v - ;* . .,.. Q a , b (t)s u, h)i{r: b r\t)), t <= ( a , », 

jtr N Qa,bW a ,bij h >> 
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and differentiating both sides, we have /' « {C?f/}'. Then we obtain the formula: 



D£[/]» = i]r p [fi(t) * rjifiit) * j^ e [{c™/)'](0- 



(9) 



2.3 Results of error analysis 

We here state the error analysis results of the presented approximate formulae, while 
their proofs are left to Section 4. Let us introduce the following function space. 

Definition 1. Let 3> be a simply-connected domain which satisfies (a, b) c '3, and 
let a be a positive constant. Then L ff (fF) denotes the family of all functions / that are 
analytic on @, and satisfy |/(z)| < C\Q a , (z)\ for a positive constant C and all ze®. 

In the statement of theorems below, @ is either tfr^ b (@d) or if/^ b (!0d), where 



0^) = {z eC: 



arg 



<rf , 






sMK) 



<<i 



These are domains that are mapped by the SE or DE transformation from a strip domain 



9 d = [(eC:\Im{\<d\, 



(10) 



for a positive constant d. With these notations, the approximate errors of the formula (8) 
and (9) are analyzed as follows. 

Theorem 1. Let (f/Q a ,b) 6 L a (if/f b (@ d )) for d with < d < n. Let fi = min{l - 
/?, a], N be a positive integer, and /i be selected by h — yJnd/(jiN). Then there exists a 
constant C independent of N such that 



max pZlfKf) - l%[{C%m(f)\ < CAfe-V**". 

t£[a,b] 



(ID 



476 APPROXIMATE FORMULAE FOR FRACTIONAL DERIVATIVES 



Tomoaki Okayama et al. 

Theorem 2. Let (f/Q a ,b) e L ff (^™ {3> d )) for d with < d < Jt/2, and let / be 
analytic at fo. Let /i = min{ 1 — p, a], Nbea positive integer with N > fi/(2d), and h be 
selected by h — log(2dN/[i)/N. Then there exists a constant C independent of iV such 
that 

max \D p a [f](t)-r*[{C™f)'](t)\ < C - -£—. e**»n*WH>\ 

The number of evaluation of / in these approximation formulae is n — 2N + 1, 
which means the convergence rate is 0(e~ r, «") in the SE-Sinc case, and 0(e~ y "/ log ") in 
the DE-Sinc case for some y > 0; in both cases the errors decay exponentially. 

Remark 1. The assumption (j/Q a ,b) e L a (£^) may seem to be not practical since 
the function / must be zero at the endpoints by the condition \f(z)/Q a ,b(z)\ < C\Q a , (z)\. 
But actually, functions in a certain wider, and reasonable space can be translated to 
those satisfying the assumption (see Stenger [17, §4]). 

3 Numerical examples 

In this section we consider two test functions, f\{f) - f 4 ^ 3 (l - f) 2 /F(7/3) and /2(f) = 
/ 2 (1 — /) 2 e', and their 1/2-order derivatives in Caputo's sense on the interval (0, 1): 



d; /2 [/ 2 ](o = 1 



r(ll/6) 187 

,1/2 



1X3/2) 



(8f 3 - 4f 2 - 22t + 31) + e'erf( Vf) {8f(2f 3 - It + 8) - 3l] 



Let Jt m denote an arbitrary positive number less than jt. Then the function f\ satis- 
fies (/i/2o,i) 6 L 1/3 (^ 1 (^J) and (/i/6o,i) 6 U/s^&x.ji)), and the function 
/ 2 satisfies (/ 2 /2o,i) 6 Li(0» (0„J) and (/ 2 /2 ,i) 6 L^™^^)). In actual com- 
putations, we set Jt m = 3.14, and then h can be selected according to Theorem 1 or 
Theorem 2. 

1/2 1/2 

The numerical result of D„' f\ is shown in Fig. 1, and the one of D ' / 2 is shown 
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1 

0.01 

0.0001 

1e-06 

1e-08 

19-10 
19-12 
19-14 

19-16 




Chebyshev formula 
SE-Sinc formula 
DE-Sinc formula 
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n=2N+\ 




10 20 30 40 50 60 70 80 
n=2N+l 



,1/2 



1/2 



Fig. 1. Approximation errors of D Q ' f\. Fig. 2. Approximation errors of D Q ' fa 



in Fig. 2. Both of the computation programs are written in C with double-precision 
floating-point arithmetic. The errors are checked on t — 0.01, 0.02, . . . , 0.99, and the 
maximum error of them is plotted on the graphs. There are three plot lines in both 
graphs; the formula by Sugiura-Hasegawa [19] (dashed line with x points), by the SE- 
Sinc methods (solid line with A points), and by the DE-Sinc methods (solid line with 
O points). The convergence profiles of the Chebyshev formula are different between 
Fig. 1 and Fig. 2. This should be caused by the singularity of the function f\ at the 
endpoint, t — 0. In contrast, we can see that the results of the SE-Sinc formula and the 
DE-Sinc formula are consistent with Theorem 1 or Theorem 2 in both graphs. 

4 Proofs of the theorems in Section 2 
4.1 Proof of Theorem 1 (the SE-Sinc case) 

The following two theorems are critical to prove Theorem 1. 

Theorem 3. Let the assumptions of Theorem 1 are fulfilled. Then there exists a 
constant C independent of N such that 



max \l l a - p [f'](t)-r*[f'](t) <Ce" 



jTzdfiN 
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Theorem 4 (Stenger [17, Corollary of Theorem 4.4.2]). Let the assumptions of 
Theorem 1 are fulfilled. Then there exists a constant C independent of N such that 



sup 

te(a, b) 



d {f(t)-C s *[f](t)} 



dr 



<CNe~ 



JjidfiN 



Using these theorems and the trivial fact sup^ ||J^||c([a,fe]) < °°, we get (11). In 
what follows, we prove Theorem 3. The next theorem is the base of the error analysis. 

Theorem 5 (Stenger [17, Theorem 4.2.6]). Let (FQ ai> ) e L,p(if/f b (@ d )) for d with 
< d < n, let N be a positive integer, and h be selected by h — yj2nd/(J3N). Then 
there exists a constant C independent of N such that 



r b N 

F(s) ds-hj] F(^ b (kh)){^ b Y(kh) 

Ja k=-N 



< Q & - ^jlndpN 



Let us apply this theorem to the approximation (6). If we put F(s) - g(s)/(t-s) p in 
this theorem, and if g is analytic and bounded uniformly on t/rf,(i^/) for all t € [a, b], 
then {FQ aJ ) e Li_ p ($^(^)). Furthermore if we set/i = minjl - p, a], then (FQ a , r ) e 
L^tf/fX^d)) since clearly L v (i/^(^)) c L p (i//f t (S> d )) if v > p. Therefore we obtain 
the next result. 

Lemma 1. Assume that there exists a constant d with < d < Ji such that g is 
analytic and bounded uniformly on i/rf r (f^) for all t e [a, b]. Let fi = min{l - p, a}, 
N be a positive integer, and h be selected by h — yjlnd/QiN). Then there exists a 
constant C independent of Af such that 



max \l a - p [g](t) - I%[g](t)\ < Ce- V^. (12) 



We can relax the condition on g using the following lemma. 

Lemma 2. Let g be analytic and bounded on i/r^(f^/) for d with < d < n. Then 
is analytic and bounded uniformly on i/rf ( (f^) for all t e [a, b]. 
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Proof. We shall establish this lemma if we prove for all t e [a, b] that i/rf,(.S^) c 
if/f b (3> d ), which is equivalent to "|Im{^ f } _1 (z)| < d => |Im{^} _1 (z)l < d" (recall 
that ® d is defined by (10)). Set £(t) - {ipf t )~\z) for simplicity. It is sufficient to 
show that |Im£(f)| is a monotonically decreasing function, since from this we have 
| lm({b)\ < | Im£(f)| < d. Let x, y e R and set z - x + iy. Then Imf(f) is expressed as 



(z — a\ 
-— 1 = arg 



ax + tx - at - x 2 - y 2 (t — a)y 



(t - x) 2 +y 2 (t- x) 2 + y z 

Considering cos(Im£(f)) and its derivative, we have 



ax + tx - at - x 2 - y 2 
cos(lmf(0) = 



^(ax + tx-at - x 2 - y 2 ) 2 + (t - d) 2 y 2 
d n „.. (t-a)((a-x) 2 +y 2 )y 2 

dt C ° S < Im ^) = {((a _ x) 2 +y 2 m _ x) 2 +y 2 )} y2 * 0. 



Thus cos(Im £(f)) is a monotonically increasing function. Since -ji < Im £ (t) < ji and 
cos(- Im f (t)) - cos(Im f (t)), we can see that | Im £{i)\ is monotonically decreasing. ■ 

Therefore Lemma 1 can be rewritten as follows. 

Lemma 3. Let g be analytic and bounded on if/ a E b (&d) for d with < d < ji. Let 



H — min{ 1 — p, a), N be a positive integer, and h be selected by h — yj27td/(jxN). Then 
there exists a constant C independent of N such that (12) holds. 

If {f IQ a ,b) £ L Q ,(i/f^ E & (^)) (assumption in Theorem 1) holds, then /' is analytic 
and bounded on ^ E , (fS^_ e ) for any e with < e < <f. Choosing e = d/2 and using 
Lemma 3, we obtain Theorem 3. 

4.2 Proof of Theorem 2 (the DE-Sinc case) 

Since sup^ ||I~™||c([a,/>]) < °°, Theorem 2 can be proved in a similar way to the SE-Sinc 
case, by showing the following two theorems. 
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Theorem 6. Let the assumptions of Theorem 2 are fulfilled. Then there exists a 
constant C independent of N such that 



max rt/']« - r$[f](fi\ < Ce-*< w/log(MiV/ '' ) . 

te[a, b]i ' ! 



Theorem 7. Let the assumptions of Theorem 2 are fulfilled. Then there exists a 
constant C independent of N such that 



sup 

t€(a,b) 



j t {f(t)-C™[f](t)} 



<c 



N 



log(2dN/n) 



-KdN/ log(2dN/fi) 



We first give the proof of Theorem 7, which is relatively short. 



4.2.1 Proof of Theorem 7 (approximation error of derivatives) 

We easily obtain that 



dt 



{f(t)-C%[f](t)} 



~ m v \{jh)) 



d 



\j\>N 



J=-o 



Q a ,b(r; b (jh)) 



At 



{Qa, b (t)sah)({^ h r i (t))) 



(13) 



Let us examine the first term. We need the following definition for it. 

Definition 2. Let %{e) be defined for < e < 1 by %{e) = {( e C : | Re f | < 
1/e, |Im£| < d{\ - e)}. Then H \S>d) denotes the family of all functions F that are 
analytic on S'j, and such that N\(F,d) - lim^o 4e> I^X^IIdfl < °°- 

Then the next assertion holds for any conformal map \jj that satisfies if/(K) = {a, b). 

Theorem 8 (Stenger [17, part of Theorem 4.4.2]). Assume the next two conditions: 

(Al) M{-))IQ a ,m-)) 6 H'(^), 



(A2) sup 

t€(a,b),-n/h<s<n/h 



df 



(Qa,b(tW Sr ' < ' ) ) 



< C/h with C depending only on ip and Q a j,. 
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Then there exists a constant C, depending only on ift, Q, d and /, such that 



sup 

t€(a,b) 



At 



™-z 



mm 

QajbMJh)) 



Qa,b{t)S{j,h){r l {t))\ 



<c e - 



-ndjh 



h 



We show that (Al) and (A2) are fulfilled with ifr(t) = ^™ h {t) under the assumption 
that (f/Qaj,) e L a (0£(^)). For (Al), it is sufficient to prove M(Q^(^(0),4) is 
finite, since \f(z)/Q a ,b(z)\ < C\Q a ah {z)\ holds by the assumption (recall Definition 1). 
The next lemma shows the desired claim. 

Lemma 4 (Okayama et al. [12, Lemma 4.6]). Let a and d be positive constants. 
Then Ni(Q a ab (^ E b (-)), d) is finite for any d e (0, jt/2). 

Using the Leibniz rule and the following inequality: 



QaJbif) 



(t - a)(b - t) 



b — a 



(O'ttO-HO) 



n(t - a)(b - t) I [1, It - a' 



b - a 



tt l0g (^) 



we easily show the condition (A2). 

Lemma 5. The condition (A2) in Theorem 8 holds with ip(t) = ifs° E b (t). 

Therefore we can use Theorem 8 to evaluate the first term in (13) as follows. 

Lemma 6. Let (f/Q a ,b) e L a (^™ (f^)) for d with < d < jt/2. Then there exists 
a constant C independent of h such that 



sup 

te(a,b) 



d « m DE ,(jti)) 

/(o- £ / .:L; ,1 ft^(05 a /oao -1 (0) 1 



At 



Qaj,WZ,Uh)r 



-nd/h 



<c 



There remains to evaluate the second term in (13); this is done by the next lemma. 



Lemma 7. Let (f/Qaj,) e L a (^™ {2> d )) for <f with < d < Jt/2. Then there exists 
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a constant C independent of h and N such that 



SU P Z 



ie(ti 



' \j\>N 



fW&M) 



Qa,b(K E b Uh)) 



At 



[Q a ,b(t)S(j,h)({^ b r l (t))} 



* C ]^M^ aeW(m - W 



Proof. First, by the identity 



Qa,b{t)S(hh){W°a h r\t)) = h 3^1 

Zk 



Xn/h 
n/A 



'ds 



and Lemma 5, it follows that for a constant C\ 



sup 

te(a, b) 



At 



{Qa,b(t)S(j,h)({^ h }-\t))} 



<Ci/h. 



(15) 



Second, by the assumption (j/Q a ,b) e L ff (t^ E , (£^)), there exists a constant C such that 



.m D : h (jh)) 



Qa,b(€VM 



^ C\Q a a MlUh))\ 



C{(b - a)/!} 



2d 



,2a 



cosh Zff (jisinh(;/z)/2) 



< C(fe - dfs 



2a -ita sinh(|j'A|) 



Furthermore using sinh(|y7z|) > (e'^' - l)/2, and putting C2 = C(b - a) 2a t 2a , we have 



z 

\j\>N 



M7 h (jh)) 



QaJ,(!f<%,Uh)) 



< C 2 2 e" fff 
= 2C 2 £< 

,/>A' 

<2C 2 I e -? aexp(s/!) d5 



exp(\jh\) 



faexpO'/i) 



<2C 2 



4C 



nahe Nh 



( jcff/ze 1 



I 2 



e -faexp(.s/i) j 



nahe Nh 



2 n~\ a exp(Nh) 



(16) 



Combining (15) with (16), we get (14). 



Theorem 7 is then established by taking /ias/i = log(2dN//j.)/N in Lemma 6 and 
Lemma 7. 
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4.2.2 Proof of Theorem 6 (approximation error of integrals) 

Theorem 6 can be shown in almost the same manner as the SE-Sinc case (Theorem 3). 
Let us start with the next theorem. 

Theorem 9 (Tanaka et al. [20, Theorem 3.1]). Let (//&,,/,) e Lg(t/^(^)) for d 
with < d < jt/2. Let N be a positive integer with N > j3/(4d), and let h be selected 
by h — log(4dN//3)/N. Then there exists a constant C independent of N such that 



F(s)ds - h J] F(<fi* b (khM^ b Y(kh) 

Ja k=-N 



< Ce -2jidN/log(4dN/fi) 



Applying this theorem to the approximation \ a p g « I*Sg, we have the next lemma. 

Lemma 8. Assume that there exists a constant d with < d < jt/2 such that g is 
analytic and bounded uniformly on (fr™(^) f° r a ^ f e t a ' ^]- Let // = min{ 1 — p, a}, N 
be a positive integer with Af > fi/(4d), and h be selected by h — log(4dN//j.)/N. Then 
there exists a constant C independent of Af such that 

max ll^felOO - i^ E [g](f)| < Ce- 2RdN/ios(4dNM . (17) 

K[a, b]' I 

We can relax the condition on g using the following lemma. 

Lemma 9. Let g be analytic and bounded on \//°\{Sid) u {b} for <i with < d < Jt/2. 
Then g is analytic and bounded uniformly on i/r™(^) f° r a ll f e t a > &]• 

Since its proof is far more complicated than the SE-Sinc case (Lemma 2), we leave 
it to the end of this section. If we accept this lemma, Lemma 8 can be rewritten as 
follows. 

Lemma 10. Let g be analytic and bounded on tff DE b (^ d )U{b} for d with < d < jt/2. 
Let /i = minjl - p, a}, N be a positive integer with Af > /x/(4d), and h be selected by 
h — log(4dN//j.)/N. Then there exists a constant C independent of Af such that (17) 
holds. 



484 APPROXIMATE FORMULAE FOR FRACTIONAL DERIVATIVES 



Tomoaki Okayama et al. 

If W/Qojb) 6 L„(0^(^)) holds, then /' is analytic and bounded on ^ b (^ d - e ) for 
any e with < e < d. Choosing e - d/2 and using Lemma 10, we obtain Theorem 6. 

It remains to prove Lemma 9. We commence by showing the following lemma. 
Here f* denotes a conjugate complex number of f , and sgn denotes the so-called sign 
function, defined by 

1 (x > 0), 

sgn(x) = \ o (x = 0), 

-1 O < 0). 

Lemma 11. Let us define 9^ and £^ rf as ^+ = {£ € ^ : Imf > 0} and <2>- - 
{£ € £>d '■ Im £ < 0), where ^ is defined by (10). Let / be a continuous function that 
satisfies /(£*) = {/(f)}* on Stj. Then the next two assertions are equivalent: 

(a) The function / satisfies that (al) Im{/(f )} = if and only if Im f = on S>a, and 
(a2) there exists f e ^+ such that Im{/(f )} > 0. 

(b) The function / satisfies sgn[Im{/(f)}] = sgn[Imf] for all ( e ^, i.e. 

Im{/(£)} > for all f e &%, (18) 

Im{/(£)} = for all f e R, (19) 

Im{/(£)} < for all f e ^J. (20) 

Proof. We show only (a) => (b) since clearly (b) => (a) holds. The second condi- 
tion (19) is obvious by the assumption (al). 

Suppose (18) does not hold, i.e. there exists t]q € 2>^ such that Imj/X^o)} < 0. 
Let *€ be a closed arc defined by ^ = {£ = /Ifo + (1 - /t)/7o : /I e [0, 1]}, where fo 
is given in the assumption (a2). Since Im / is continuous on % ', there exists ( e f 
such that Imj/Xf)} = by the intermediate-value theorem. This is contradictory to the 
assumption (al). Thus the first condition (18) holds. 
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The third condition (20) can be shown in the same manner as (18), if we find some 
£o £ S>7 such that Im{/(£o)} < 0. By assumption it holds that 

Im{/(^ )}+Im{/(^)} = lm[f{^)+f{Q] = M/tfo) +{/(&)}*] = Im[2Re{/(^ )}] = 0. 

Thus lm{f(Q)} < 0, which completes the proof. ■ 

Let us define functions G\ and Gt as 



/ : Vi+>7 2 



We can check that G\ satisfies the assumptions of Lemma 11 and the assertion (a). 
Hence we have the next lemma from the assertion (b). 

Lemma 12. The equality sgn[Im{Gi(^)}] = sgn[Im7/] holds for all rj e Q)\. 

In fact it holds for all rj e C, but r\ e $)\ is sufficient for our purpose. 

Lemma 11 can not be applied directly to the function G2 since G2(±i) = ±00, i.e. 
G2 is not continuous at rj — ±i . However, G2 still satisfies Im{G2(+i)} = +°° > and 
Im{G2(-i )} = -°o < 0. Therefore sgn[Im{G2(77)}] = sgn[Im{?7}] holds even if r\ — ±i . 
Then we have the next lemma. 

Lemma 13. The equality sgn[Im{G2(?7)}] = sgn[Im7/] holds for all 77 e S)\. 

Using the two lemmas above, we prove Lemma 9. 

Proof. In the same argument as in Lemma 2, we consider the function cos(Im£(f)), 
where g(f) = {i/'™} _1 (z), and show it is a monotonically increasing function. Since 



j ( cos(lm(£(0)) = -sin(Imf(r))^{Im£(r)}, 



we examine the signs of sin(Im£(f)) and {Im£}' below. Let us define a function 77(f) as 
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/7(f) = I log (fEf)- Then ((f) = log^f) + y/l+rftf)}, and 

Im [77(f) + yjl + T] 2 (t)] 

Ut) + vi + ^ 2 (o| 



sin(Im£(f)) 



because sin(arg(£)) = Im£/|£| for all £ e C. We set x, = Re 77(f) and y, = Im 77(f) here. 
Note 77(f) e $)\ by definition. According to Lemma 12, it follows for all 77(f) e 2>\ that 



sgn < Im 



77(f) + ^1 + 77 2 (f) 



sgn{Im[G 1 (77(f))]} = sgn{y r }. (21) 



Next we examine the sign of {Im£ }'. The function Im^(f) can be written as 



Imf(f) = 



2i 



los 



g 1 77(f) + ^1 + iftol - log \rf(f) + Jl + {rf(t)\ 



By differentiating and rewriting this equation, we obtain 



d lm\(t-z)^l+i 1 2 (t)\ 

- Imf(f) = — t i 

df 



\(t-z)y/l+i] 2 (t)\ 



From the definition of 77(f), we have (t - z) - (t - a) /(I + e™ 7 '®), and then 



-lm^) = 



1 + e™* 



V(f-a){l+/7 2 (0) 



Im 



Vi + iiHt) 



l + e mi(t) 



Thus by applying Lemma 13, it follows for all 77(f) e &\ that 



sgn < Im 



Vj + r/ 2 (0 

l + e Jt?7(f) 



= sgn {Im [G 2 (-77(f))]} = - sgn {y t } 



(22) 
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Finally, using the expressions (21) and (22), we obtain the desired conclusion: 



) d n fruxvX {sgn(y,)} 2 
sgn \ — cos(Im(£(f))) \ = j , sgn < 

> sgn ||t7(0 + VI + ^7 2 (0| 



I + e Jt>K«) 



V(f-a){i + 7/ 2 (f)} 



2^ 



>0. 
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Boundary type quadrature formulas over axially 
symmetric regions 
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Abstract 

A boundary type quadrature formula (BTQF) is an approximate integration 
formula with all its of evaluation points lying on the boundary of the integra- 
tion domain. This type formulas are particularly useful for the cases when 
the values of the integrand functions and their derivatives inside the domain 
are not given or are not easily determined. In this paper, we will establish 
the BTQFs over some axially symmetric regions. We will discuss the follow- 
ing three questions in the construction of BTQFs: (i) What is the highest 
possible degree of algebraic precision of the BTQF if it exists? (ii) What is 
the fewest number of the evaluation points needed to construct a BTQF with 
the highest possible degree of algebraic precision? (iii) How to construct the 
BTQF with the fewest evaluation points and the highest possible degree of 
algebraic precision? 

1 Introduction 

Although numerical multivariate integration is an old subject, it has never been 
applied as widely as it is now. We can find its applications everywhere in math, 
science, and economics. A good example might be the collateralized mortgage 
obligation (CMO), which can be formulated as a multivariate integral over the 180- 
dimensional unit cube ([2]). A boundary quadrature formula is an approximate 
integration formula with all its evaluation points lying on the boundary of the 
domain of integration. Such a formula may be particularly useful for the cases 
when the values of the integrand function and its derivatives inside the domain are 
not given or are not easily determined. 

Indeed, boundary quadrature formulas are not really new. From the viewpoint 
of numerical analysis, the classical Eulcr-Maclaurin summation formula and the 
Hcrmite two-end multiple nodes quadrature formulas may be regarded as one- 
dimensional boundary quadrature formulas since they make use of only the inte- 
grand function values and their derivatives at the limits of integration. The earliest 
example of a boundary quadrature formula with some algebraic precision for mul- 
tivariate integration is possibly the formula of algebraic precision (or degree) 5 for 
a triple integral over a cube given by Sadowsky [30] in 1940. He used 42 points 
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on the surface of a cube to construct the quadrature, which has been modified by 
the author with a quadrature of 32 points, the fewest possible boundary points 
(see [9] and [10]). Some 20 years later after Sadowsky's work, Levin [26] and [27], 
Federenko [6], and Ionescu [21] investigated individually certain optimal boundary 
quadrature formulas for double integration over a square using partial derivatives 
at some boundary points of the region. Despite these advances, however, both the 
general principle and the general technique for construction remained lacking for 
many years. 

During 1978-87, based on the ideas of the dimension-reducing expansions (DRE) 
of multivariate integration shown in Hsu 1962 and 1963, Hsu, Wang, Zhou, Yang, 
and the author developed a general process for the construction of BTQFs in [17]- 
[20] and [9]-[15]. 

The analytic approach for constructing BTQFs is based on the dimension- 
reducing expansions (DRE), which reduces a higher dimensional integral to lower 
dimensional integrals with or without a remainder. Hence, a type of boundary 
quadratures can be constructed by using the expansions. 

The DRE without remainder is also called an exact DRE. Obviously, a DRE can 
be used to reduce the computation load of many very high dimensional numerical 
integration's, such as the CMO problem mentioned above. Most DRE's are based 
on Green's Theorem in real or complex field. In 1963, using the theorem, Hsu [17] 
devised a way to construct a DRE with algebraic precision (degree of accuracy) 
for multivariate integrations. From 1978 to 1986, Hsu, Zhou, and the author (see 
[18], [19], [20], and [?]) developed a more general method to construct a DRE with 
algebraic precision and estimate its remainder. In 1972, with the aid of Green's 
Theorem and the Schwarz function, P.J. Davis [4] gave an exact DRE for a double 
integral over a complex field. In 1979, also by using Green's Theorem, Kratz [24] 
constructed an exact DRE for a function that satisfied a type of partial differen- 
tial equations. Lastly, if we want this introduction to be complete, we must not 
overlook Burrows' DRE for measurable functions. His DRE can reduce a multi- 
variate integration into an one dimensional integral. Some important applications 
of DRE include the construction of BTQFs and asymptotic formulas for oscillatory 
integrals, for instance, the integrals on spheres, S d = {x <G R d : |x| = 1} and balls, 
B d = {x e R d : \x\ < 1}, presented by Kalnins, Miller, Jr., and Tratnik [22], 
Lebedev and Skorokhodov [25], Mhaskar, Narcowich, and Ward [28], Xu [35], etc. 

In this paper, we will discuss the algebraic approach to constructing BTQFs for 
a multiple integral over a bounded closed region f2 in K" , which is of the form 

w(X)f(X)dX. 

n 

In this expression, w(X) and f(X) are continuous on f2, and w(X) is the weight 
function. (w(X) can be 1 particularly.) We are seeking the BTQF of the integral 
with the form 

f W (x)f(x)dx « Yl j2 a ? 1 ''"' mnDmi, '" ,mn f( x *')> w 

n CKmH hm„<ra iel 

where dX is the volume measure; a™ 1 ' ' m ™ (i € I and < rni + • • • + m n < m) are 
real or complex quadrature coefficients; D Tni ''"' mn = <9 miH ^ m ^ j dx" 11 ■■■x™ n ; 
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and Xi = (xi t i,Xi2i • • • ,%i,n) {i G I) are evaluation points (or nodes) of / on dfl, 
the boundary of f2. In particular, when m = we write a™ 1 '"' m " = a * ano - formula 
(1) can be rewritten as 



[ w(X)f(X)dX n Vdi/fli). 
•to ie/ 



(2) 



(2) is called a BTQF without derivative terms. When m ^ 0, (1) is called a BTQF 
with derivative terms. The corresponding error functionals of approximations (1) 
and (2) are defined respectively by 



E(f) = E(f;Cl) = [ w(X)f(X)dX 



and 



J2 J2 a T 1 ''"' mnDmi ''"' mn fi X i) (3) 

0<rniH \-m n <7n i£l 

E(f) = E(f; Q) = [ w(X)f(X)dX - V a t f{X^. (4) 

Suppose that <9f2 can be described by a system of parametric equations. In 
particular, the points X = (xi, • • • , £«) on c?Sl satisfy the equation 

*(*) = 0, (5) 

where <& has continuous partial derivatives. In addition, <&(X) < for all points in 

n. 

Let S be another region in R™, and let J : Y = JX, X G SI, be a transform 
from SI to S with positive Jacobian 



\J\ 



d(Y) 



d(X) 



>o, 



X G CI. J is one-to-one and has the inverse J 1 : X = J 1 Y, Y € S. Denote 
W\{Y) — wi(JX) — w(X). Then for any continuous function g(X) 

Wl {Y)g{Y)dY= f Wl (Y)g(Y)\J\dX. 

Denoting Y t = JX t (» G J), | J«| = \J\ x=Xi , and taking /(X) - \J\g(Y) = | J|«/(JJf) 
in equation (4), we obtain 



E(\J\g;Cl) = [ w{X)\J\g{Y)dX-Y^a l \J l \g{Y l )= f Wl (Y) 3 (Y)dY - V 6, 



s(X), 



where bi — di\Ji\ (i G /). Obviously, if Y, the boundary points of S, satisfy 
<I> 1 (Y) = $i(JX) — &(X) = 0, then J maps the boundary evaluation points Xi 
(i G I) on f2 onto the boundary evaluation points Y = •/-X'i on S. Consequently, 
we have the following result. 
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Theorem 1 Let the error functional of the quadrature formula 

f Wl {Y)g{Y)dY ?*Y, b ^) ( 6 ) 

J s ie i 

be E(g;S) = j ?Wl (Y)g{Y)dY -£igiM*i)- Then E{g; S) = E{\ J\g;Q). In par- 
ticular, if \J\ is a constant, then E(g;S) — \J\E(g;fl). In this case, E(g;£l) = 
implies E(g; S) = 0. 

In addition, if the boundary of S is defined by &i(Y) = $i(JX) = Q(X) = 
and <&(X) = defines the boundary of 51, then quadrature formula (6) is also a 
BTQF. 

In this paper, we will establish the BTQFs over some axially symmetric regions 
or fully symmetric regions (see the definitions below). Theorem 1 tells us that we 
can construct the BTQFs over many more regions from the obtained BTQFs over 
the special regions by using certain transforms. In addition, if the transform is 
linear, then the new BTQF is of the same algebraic precision degree as the old 
BTQF. 



2 BTQFs without derivatives 

Three questions arise during the construction of BTQFs (1): 

(i) What is the highest possible degree of algebraic precision of the BTQF if it 
exists? 

(ii) What is the fewest number of the evaluation points needed to construct a 
BTQF with the highest possible degree of algebraic precision? 

(iii) How to construct the BTQF with the fewest evaluation points with the 
fewest evaluation points and the highest possible degree of algebraic precision? 

We now answer the first question. In most cases, BTQF (1) has an inherent 
highest degree of algebraic precision. For instance, if &(X) is a polynomial of degree 
to, then the highest possible degree of algebraic precision of the BTQF without 
derivative terms (i.e., formula (2)) cannot exceed to — 1 because the summation on 
the right-hand side of (2) becomes zero and the integral value on the left-hand side 
is negative when / — $. Hence, when the boundary function $ is a polynomial of 
a low degree, to raise the degrees of algebraic precision of the quadrature formulas, 
we must construct BTQFs with derivative terms (i.e., formula (1) with m ^ 0). 

In the following, we are going to find the solutions to questions (ii) and (iii). To 
simplify our discussion, we limit the region in question, 51, to be axially symmetric 
or fully symmetric. An axially symmetric region is a region that for any point 
X = (#i, • • • , x n ) in it, must contain all points with the form (±xi, • • • , ±x n ). The 
set of axially symmetric points associated with X forms a reflection group. If a 
region containing a point X = (xi, • • • , x n ) also contains all points (±ai, • • • , ±a n ), 
where (ai,--- ,a n ) is a permutation of (xi,--- ,x n ), then the region is called a 
fully symmetric region. Throughout, we will denote all fully symmetric points, 
(±oi, • • • , ±a„), associated with X by Xps and call X the generator of the fully 
symmetric point set. The cardinal number of the set of fully symmetric points 
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associated with a generator X e 1" is 2™(n!). Obviously, a fully symmetric region 
is an axially symmetric region, but the converse is not true. 

A quadrature formula is called a fully symmetric quadrature formula if the 
quadrature sum can be divided into several subsums such that in each of the sub- 
sums, the evaluation points are fully symmetric and the corresponding quadrature 
coefficients are the same. In addition, if the fully symmetric evaluation points are 
on the boundary of the integral region, then the corresponding quadrature formula 
is called a fully symmetric BTQF. 

Denote a monomial in terms of X by X a {a £ Zq), which can be written in the 
form X a = x" 1 , • • • , x^™ , where («i, • • • ,a n ) is called the exponent of X a . 

From the definition of the fully symmetric region, we immediately have the 
following results. 

Theorem 2 The value of a multiple integral of a monomial X a over an axially 
symmetric region is zero if a contains an odd component. The value of a multiple 
integral of X a over a fully symmetric region depends on a, but is independent of 
the order of a, (i = 1, • • • , n). 

Theorem 3 Denote by 7r"(X) the set of all polynomials of degree no greater than 
r. Let fl be a fully symmetric region, 

[ f{X)dX^Y^a i f{X i ) (7) 

Ja iei 

be a fully symmetric BTQF, and E : f — ► M be the error operator defined by 
E(f)=E(f;n)= ( f(X)dX-Y^a l f{X l ). 

(The above expressionis a special form of (4) withw(X) = I.) Themr^k+i C N(E), 
the null space of E, if and only if 

x\ kl ■ ■ ■ xf n £ N(E) < hi < ■ ■ ■ < k n , ki + ■ ■ ■ + k n < k. (8) 

Theorem 3 can be considered as the general principle for constructing fully 
symmetric BTQFs. First, we set one or more sets of fully symmetric evaluation 
points, with possibly some unknown points {ATi}, on the boundary <9f2 and assume 
the quadrature coefficients Oj corresponding to each set to be the same. Then 

substituting all f(X) = x\ kl ■ ■ -x 2 ^ (0 < fci < • • • < k„ and fci H h k„ < k) 

into E(f) — 0, we obtain a system about Xi and Oj. Finally, we solve the system 
for Xi and a, and a quadrature formula is constructed. However, a fully symmetric 
quadrature formula usually has too many evaluation points. (Remember that for 
a point X £ R" there are, in general, 2"(n!) fully symmetric points.) In order to 
reduce the number of evaluation points in the quadrature formula, we can use an 
alternative form of Theorem 3 to construct a different type of symmetric quadrature 
formulas. We will use the following example to illustrate the idea. 

Example 1. Consider a triple integral over the region C3 = 

[— 1, l] 3 . Obviously, the inherent highest degree of algebraic precision of the BTQF 
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is 5. To construct a fully symmetric BTQF, we make use of the following fully 
symmetric evaluation points. 



(1,0,0)^5, (1,1,0) F3 , and{l,x , x )fs, 

where Xq (0 < x < 1) is undetermined. The three sets of fully symmetric points 
contain a total of 42 points (6, 12, and 24 points for the first, second, and third 
set respectively). Let the respective quadrature coefficients for each set of fully 
symmetric points be L, M, and N, all of which can be found using the general 
principle for constructing fully symmetric BTQFs. Substitute / = 1, x 2 , x 4 , and 
x 2 y 2 into 

/ f(x, y, z)dxdydz = a x ^ fe + a 2 E $ 12 + ° 3 E -^ 24 ' 
J C3 

where ^2 /@, ^2 /12, and ^2 f 2 4 are the sums of the function values of / over the first, 
second, and third set of symmetric points, respectively. Solving the above system 
yields 

364 160 64 

225' &2 ~ ^225' ° 3 
giving the following BTQF of algebraic precision order 5 



''" l "' ai 225' ° 2 225' ° 3 225' 



J f(x, y, z)dxdydz « A [si E /« " 40 E ^ + 16 E -H ■ ( 9 ) 

Quadrature formula (9), given by Sadowsky [30], uses too many evaluation 
points. Carefully considering Theorem 3, we find that the principle of construct- 
ing fully symmetric BTQFs shown in the theorem can be used to construct some 
"partial" symmetric BTQFs with fewer evaluation points. 

A set of points Xi € R™ (i G I) is called a symmetric point set of deyree k if it 
possesses the following two properties. 

(a) ^2 ieI f(Xi) = for all f(X) = X a , where a contains an odd component. 

(b) J2 ie i f( X i) arc thc samc for a11 f( X ) = x l kl ■ ■ ■ x i k "> 2 ( fc i + --- + K) = r. 
Here, r < k. 

Obviously, a set of fully symmetric points must be a set of symmetric points of 
any degree, but the converse is not true. For instance, a symmetric point set of de- 
gree 5 may not be a fully symmetric point set. We now list all symmetric point sets of 
degree 5 on the boundary of C3 as follows. / = {(±1, ±xq, 0), (±#o, 0: ±1)7 (0, ±1, 
±s ),0 < x Q < 1}, II = {(±y ,±l,0),(±l,0,±yo),(0,±2/o,±l),0 < y Q < 1}, 
III = {(±1,±1,0),(±1,0,±1),(0,±1,±1)}, IV ={(±1,±1,±1)},V={(1,0,0) FS }, 
VI = {(1,x 1 ,x 2 )fs,0 < Xi,X2 < 1}, VII = {(1,1, £3)^,3,0 < x 3 < 1}, where sets 
V , VI, and VII are fully symmetric, but others are not. 

If a BTQF constructed by using symmetric point set of degree k satisfies condi- 
tion (8), then it is called a symmetric BTQF of deyree k. 

Example 2. As an example, we now use the sets /, III, and IV to construct a 
symmetric BTQF of degree 5 with 32 evaluation points over C3. Denote the quadra- 
ture coefficients corresponding to /, ///, and TV as a\, a 2 , and a 3 respectively. 
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Following the procedure shown in Example 1, we obtain a symmetric BTQF of 
degree 5 as follows 

f(x, y, z)dxdydz 
c 3 

^ 4[ 8 °^ /l2(/)_52 ^ /l2(///)+21 ^ /8(/y) ]' (10) 

J2 fi2(I), X) fi2(IH), and J2 fs(IV) are the sums of the function values of / over 
the symmetric point sets /, 77/, and IV, respectively; the numbers in the sub- 
indices are the cardinal numbers of the corresponding set. 

Similarly we can use sets //, III, and IV to construct another symmetric BTQF 
of degree 5. 

f(x,y, z)dxdydz 
c 3 

* 4[80^/i 2 (//)-52^/ 12 (m) + 21^.f 8 (/t/)], (11) 

where y a — J ^ in set II. 

Quadratures (10) and (11) can be considered as two special cases of the following 
symmetric BTQF of degree 5, which is constructed by using /, 77, and IV. 



9(so - Vt 



^ < y < l, yo + 



4(l + 2/o 2 ) 

J2fMn) + \j2f^ Iv ^ ( 12 ) 



/ f(x, y, z)dxdydz w — 2 _' ° ^ fM 1 ) 
>c 3 y \Vo x o) 

4(1 + 




where 



When yo — I and y = J ^ we obtain formulas (10) and (11), respectively. 

It can be proved that the minimum number of evaluation points of symmetric 
BTQFs is 32. Since the quadrature formula is symmetric, on each boundary plane 
we must have the same number of evaluation points. Let the number of evaluation 
points on each boundary plane be k — 2 (Obviously, k cannot be 1). The sym- 
metric point set has to be / or //. It is easy to check that the sets cannot yield a 
symmetric BTQF of degree 5. Similarly, for the cases of k = 3, • • • , 9, no matter 
which symmetric point sets are chosen from {7, • • • , VII}, we find that there does 
not exist any symmetric BTQFs of degree 5 with evaluation points less than 32. 
For k > 10, every symmetric BTQF of degree 5, if it exists, must have more than 
32 evaluation points. Thus, we obtain the following proposition. 

Proposition 4 There exist infinitely many symmetric BTQFs of degree 5 with 32 
evaluation points. In addition, the number of evaluation points of a symmetric 
BTQFs of degree 5 can not be less than 32. 
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For BTQFS of degree 3, the minimum number of the evaluation points is reduced 
to 6. As an example, we give the following formula. 



f(x,y,z)dxdydz * Uf (1,0,0) + f (-1,0,0) 
c 3 J 

+/(0, 1,0) + /(0, -1,0) + /(0,0,1) + /(0,0,-l)] . 

Example 3. We will use a double layered spherical shell as an example to 
demonstrate the techniques of regrouping evaluation points to obtain the symmetric 
BTQF with the fewest evaluation points. A double layered spherical shell in R n , 
denoted by Sh n , is defined by 

Sh„ = {X e K" : a 2 < \X\ < b 2 }. 

It is easy to find that the largest degree of algebraic precision of BTQFs over Sh n 
without derivatives is 3. We choose the following point sets as evaluation points: 
VIII = {(±6,0,- ■■ ,0),(0,±b,0,--- ,<)),■■■ ,(<),■■■ ,0,±b,0)}, IX = {(0, ■ ■ ■ ,0, 
±6)}, X = {(0,--- ,0,±a)}. 

Obviously, these sets are neither fully symmetric point sets nor symmetric point 
sets of degree 3, but by using these sets, we can construct a BTQF of degree 3 
over Sh n with the fewest evaluation points. Denote the quadrature coefficients 
corresponding to VIII, IX, and X by oi, &2, and 03, respectively. The BTQF 
generated, 

J I(X)dX^a 1 Y^h(n-i){VIII) + a 2 Y^h{IX)+a i Y,h(X), (13) 

is of algebraic precision of degree 3 if it holds exactly for / = 1, x\, and x^; i.e., 
coefficients en (i = 1, 2, 3) have to be 

01 = a(b 2 - a 2 ) (b n + 2 - a n + 2 ) 

a 2 = a (b n+i + (n+ l)a n+2 b 2 - 3b n+2 a 2 - (n - l)a n+i ) 

a 3 = ab 2 (2b n+2 - (n + 2)a n b 2 + na n+2 ) , 



where 



-Trn/2 



26 2 r(f + l)(n + 2)(6 2 -a 2 )' 



When n — 2 and 3, formula (13) gives BTQFs over a ring domain and a 3- 
dimensional double layered spherical shell respectively as follows. 

f(x,y)dxdy 

Sh 2 

^W^ {{b2 + ° 2)[/(6 ' 0) + /(_6 ' ° )] + 262[/( °' a) + /( °' _a)] 
+(b 2 -a 2 )lf(0,b) + f(0,-b)}} 



497 
Boundary type quadrature formulas 



f(x, y, z)dxdydz 

Sh 3 

2tt 



{(fo 2 -a 2 )(fo 5 -a 5 )[/( o ,0,0) + /(-o,0,0) + /(0,6,0) 



156 2 (6 2 -a 2 ) 

+/(0, -b, 0)] + b 2 (2b 5 - 5a 3 6 2 + 3a 5 ) [/(0, 0, a) + /(0, 0, -a)] 

+(fe 7 - 3a 2 o 5 + 4a 5 b 2 - 2a 7 ) [/(O, 0,6) + /(O, 0, -6)] } . 

Taking the limit a — > 0, from quadrature formula (13) we obtain the following 
quadrature formula over the sphere 5*3, which has the algebraic precision of degree 
3. 

W 2 6™ 

fix, y, z)dxdydz m -. r 

s 3 2(n+2)r(f + 1) 

£ h(n-i)(VIH) + ]T h{IX) + 4/(0, ■ ■ ■ , 0)' 

We now prove that BTQF (13) is a formula with the fewest evaluation points. 

Theorem 5 The minimum number of evaluation points of BTQFs over an n- 
dimensional double layered spherical shell Sh n is 2(n +1). In particular, the 
minimum number of evaluation points for BTQFs over a ring domain and a 3- 
dimensional double layered spherical shell are respectively 6 and 8. 

Proof. For a BTQF over Sh n with precision degree 3, we will first prove that the 
minimum number of evaluation points on the outside layer of Sh n cannot be less 
than In. Without a loss of generality, we assume that the number of evaluation 
points on the outside layer is In — 1. (The cases when the minimums are less 
than In — 1 can be proved similarly.) We will see that a contradiction from this 
assumption. If the assumption is valid, we take the limit a —> to the BTQF and 
obtain a quadrature formula over an n-dimensional sphere with In — 1 evaluation 
points as follows. 

2n-l 

f(X)dX « ao/(0, • • • , 0) + V cufiXi), (14) 

s- i=l 

where Xi (i — 1, • • • , In — 1) lie on the sphere surface and a, ^ (i = 1, • • • ,n). 
We will prove it cannot be of algebraic precision degree 3. 
Let us consider the following In complex vectors 

AX ir -- ,AX n ,Axj,AXl--- ,AXl (15) 

where 

AXi = (y/0lXl,i, \fa~ix2S, • • • , v/°2n-l»2n-l,i) 

and 

-2 _ / r— 2 n^J2 i- 2 



AX i — (y/OlX lyi , ^/w2X 2i , • • • , \fO,2n-\X2n-l,i)- 

Assume that there exist constants bi (i = 1, • • • , In) such that 

hAX-i + ■■■ + b n AX n 

+b n+l Ax\ + b n+2 AX 2 + ■■■ + b 2n AXl = 0. (16) 
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Taking dot product with AX, (i = 1, ■ ■ ■ , n) on both sides of (16) and noting that 
the quadrature sums in (14) are vanishing for all / = X a if a has an odd component 
and |a| < 3, we obtain 



biAXi ■ AX, = h^2 a ^ = °> i = !i 



, n. 



Since the sums in the above equation are the quadrature sums of BTQF (14) for 
f(X) = X a with a — 2ej ({ei,e2, • • • , e n } being the standard basis of W 1 ), which 
are not be zero, we obtain 6^ = for all i = 1, • • • , n. Consequently, equation (16) 
is reduced to 

b n+1 Ax\ + b n+2 AX 2 2 +■■■ + b 2n AX 2 n = 0. (17) 

Taking the dot product with A = (^/al, • • • , A /a 2 „-i) on both sides of equation (17) 
and noting that the quadrature sums in (14) are vanishing for all / = X a if a = 3, 
we obtain 



2 , \bn+, i-v;||, 



AXJI? =0. 



Hence, 

Vl^AXi + • • • + V^AY,, = 0. 

Similarly, we have & n +i = for all n = 1, • • • ,n. Thus, vectors (15) are linearly 
independent, but this is impossible because all of them have In — 1 components. 
This contradiction means that the number of evaluation points on the outside layer 
for any BTQFs over Sh n with algebraic precision degree 3 must be more than In— 1. 
We now prove that the number of evaluation points on the inside layer for any 
BTQFs over Sh n with precision degree 3 cannot be less than 2. Otherwise, if there 
is none or there is only one evaluation point, Xq = (#o,i,a;o,2, ' ' ' > x o,n)i on the 
inside layer of Sh n , then a BTQF over Sh n with algebraic precision degree 3 is not 
exact for quadratic polynomial f(X) = X)"=i x 1 — ^ or ^ or a curj ic polynomial 

where xqj ^ 0. This completes the proof of theorem. □ 

A similar argument of the proof of Theorem 5 can be applied to solve other 
minimum evaluation point problem. For instance, we have the following result. 

Theorem 6 The minimum number of the evaluation points needed for constructing 
a quadrature formula over an axially symmetric region in R" with algebraic precision 
degree 3 is In. 

The construction of a quadrature formula of this type can be found in Section 
3.9 of Stroud [33]. 

The minimum number of the evaluation points needed for constructing a quadra- 
ture formula over an axially symmetric region in R™ with certain algebraic precision 
degree is topologically invariant under a reflection group action. 
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3 BTQFs with derivatives 

To improve the algebraic precision degrees of BTQR's, we use the derivatives of the 
integrands. As examples, we will construct symmetric quadrature formulas over the 
surfaces of the regions C 2 = [— 1, l] 2 , C3 = [— 1, l] 3 , and the n-dimensional sphere 

Example 4. Denote the sets of fully symmetric points XI = {(1, 1)fs} and 
XII = {(l,0)ps}. We construct a symmetric BTQF with precision degree 5 over 

C2 = [— 1, 1] as follows. 

^ f(x, y)dxdy » 01 ^ h( XI ) + «2 Y, M XH ) 

C2 

+a 3 [f' x (l, 1) - f'A-h -1) + £(1, -1) - /i(-l, 1) 

+/;(i, i) - /;(-i, -i) + /i(-i, i) - /ja, -i)] 
+04[/i(i, o) - /i(-i,o) + /;(o, 1) - /;(o, -l)]. 

Obviously, the above quadrature formula is of precision degree 5 if it is exact for 
f(x,y) — 1, x 2 , x 4 , and x 2 y 2 . Therefore, we obtain 

1 16 2 2 

ai "~15' a2 -15' a3 -45' a4 = _ 9' 
We use the following numerical example to show the good accuracy of the above 
BTQF. Considering function f(x,y) = e~ x ~ v and applying the last quadrature 
to the integral of f(x, y) over [0, 2] 2 , we obtain 

f(x,y)dxdy = \ f e-^+^+^+^^dxdy 
[0,2] 2 4 Jc 2 

« -- (e- 2 + 2c- 1 + 1) + — (V 5 / 4 + 2c- 1 / 4 
60 v ; 15 V 

-l(2 e - 2 + 2 e - 1 ) + V 5 /4 = 05576i 

while the actual integral value is 0.5577. 

Similarly, we can construct a BTQF over C3 = [— 1,1] 3 with precision de- 
gree 7 and 50 fully symmetric evaluation points XIII = {(1, 1, 1)fs}, XIV = 
{(1,0, 0) FS }, XV = {(1, |,0) FS }, and XW - {(1, 1, 0) FS } as follows. 

^ /(*, y, z)dxdydz « 01 ^ /8(A//I) + a 2 ]T / 6 (X/V) 
C3 



-tt3 



^ / 24 (Xy) + a 4 X] /i2(-X"V7) + asMx + a 6 M 2 + a 7 M 3 , 



,,.),„„ „ _ 1 „ _ 16 _ _ 512 - _ 64 „ _ 11 „ _ 16 _ _ 172 

wnere 01 - g , a 2 - - m > a 3 - 945' ° 4 ~ ~Ts5' a s ~ ~405> a e - -gi' ° 7 - 2835' 
Mx = £(1, 1,1) -£(-1,-1, -1) + £(1, 1,-1) -£(-1,-1,1) 

+£(1, -1,-1) - £(-1, 1, 1) + /i(l, -1, 1) - £(-1, 1, -1) 
+/;(-i, 1, -l) - /;(i, -1, 1) + /;(-i, 1, i) - f y {\, -1, -i) 
+/;(i, 1, -l) - /;(-i, -1, 1) + f y {i, 1, 1) - /;(-i, -1, -l) 
+f z (-l, 1, 1) - £(1, -1, -1) + £(1, -1, 1) - f' z (-l, 1, -1) 
+f z (l, 1, 1) - £(-1, -1, -1) + £(-1, -1, 1) - £(1, 1, -1), 
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M 2 = /;(l,0,0)-/^(-l,0,0) + ^(0,l,0)-/;(0,-l,0)+^(0,0,l)-^(0,0,-l), 

and 

M 3 = £(1,1,0) -£(-1,-1,0) + £(1,-1,0) -£(-1,1,0) 
+£(1,0, 1) - £(-1, 0, -1) + £(1,0, -1) - £(-1,0, 1) 

+ fy(0, 1, 1) - /i(0, "I, -1) + £(0, 1, -1) - £(0, "I, 1) 

+£(i,i,o) - £(-i,-i,o) + £(-i,i,o) - £(i,-i,o) 

+£(1,0,1) -£(-1,0,-1) + £(-1,0,1) -£(1,0,-1) 
+£(0, 1, 1) - £(0, -1, -1) + £(0, -1, 1) - £(0, 1, -1). 

Example 5. Choose 2n fully symmetric evaluation points XVII = {(r, 0, • • • , 0)fs}- 
We can obtain a BTQF over S n (Y^i—i x 2 < r 2 ) with the precision degree 3 as fol- 
lows. 



7 -n/2 n+l 

f(X)dX « — i—. , 

s n 2n(n + 2)r(f + 1) 



2 ^J 2n {XVII) 



-£ i (r,0,---,0) + £ 1 (-r,0,---,0) £ n (0,- • • ,0,r) + £ n (0, • • • ,0,-r) 



At the end of this section, we discuss the construction of the numerical quadra- 
ture formulas over S n = {X e R™|X| = 1} using some recent results in [35], 
where S n is the surface of the unit sphere B n = B n (\) = {X <G R"|X| < 1} in 
R™. Let H be a function defined on W 1 that is symmetric with respect to x n ; i.e., 
H(X, x n ) = H(X, —x n ), X e R™ _1 . Then for any continuous function / defined on 

f{Y)H{Y)dfi n 

f [f (x, V^W) + / (x, Ji^Wf) 

xH(X,^l-\X\ 2 ) - rfX =, (18) 



where y G S n , X E R™ 1 , — 1 < £ < 1, and du) n is the surface measure on S n . 
The volume of S n is w n = J § d\x n = 27r"/ 2 /T (f ). Formula (18), shown in Xu [35], 

can be proved straightforwardly by substituting d/i n = (1 — t 2 )( n ~ 3 ^ 2 dtdn n -i and 
y = (\/l — t 2 X, i) into the left-hand integral of the equation. 

(18) changes a boundary integral into an integral over the interior of the bound- 
ary. Hence it can be used to derive a BTQF over B n from a quadrature formula of 
an integral over B n -\. Following [35], suppose that there is a quadrature formula 
of precision degree m on B„_i 

r fix N 

/ g(X)H (x, ^W?) 7= ^^ « E °tf (*0; 

that is, the quadrature formula is exact for all polynomials in 7tJ5j _1 , which denotes 
the set of all polynomials defined in M™ -1 with a total degree not more than m. 
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Then there is a quadrature formula of homogeneous precision degree mon S n : 
f(Y)H(Y)dfi n 



N 



Y, Oi [/ (*i, V 1 - l^l 2 ) + / (Xi, -y/l-\Xi\ 



(19) 



2=1 



Recently, Mhaskar, Narcowich, and Ward (see [28]) developed a new method 
for obtaining quadrature formulas on S n , which can be applied to the right-hand 
integrals of equation (20) in the following theorem, so that the BTQFs over B n can 
be constructed. 

Theorem 7 Suppose that F(X) is a continuous function defined on the sphere 
B n (x\ + ■ ■ ■ + x\ < 1) that has 2m order continuous partial derivative with respect 
to x n . Then there exists the following expansion that has m terms and possesses 
2m — 1 of algebraic precision. 



JB n k=Q Jb n -x 



k (F(X),U m (X))dS + Pri 



(20) 



where Lfc(-, •) is defined by 



L k (F,G) 



(d k F\ fd m - k - l G\ fdx. 



\ O^n / \ (JX n 



l-k-1 



dv 



and p m has estimate 



\Pm\ < 



r(m+f + l)(2m)! 



Q2mp 



dxl n 



or 



\Pm\ < 



r(m+f + l)(2m)! 



Q2mp 



dx 2 " 



(21) 



(22) 



Formula (20) can be proved using the Green's formula successively, and it is 
omitted here. 
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Theoretical Analysis and Numerical 
Realization of Bioluminescence Tomography 

Rongfang Gong 1 , Xiaoliang Cheng 2 and Weimin Han 3 

Abstract. Mathematically, bioluminescence tomography (BLT) is an ill-posed inverse 
source problem. In this paper, a new formulation for the BLT problem is developed. It 
is used to explain rigorously the reason behind the loss of the continuous dependence of 
the light source function solution on the measurements. On the basis of the formulation, 
an approximation using finite element method is provided. Some error estimates for the 
reconstructed light source function are obtained. By using adjoint equations, a simple 
but efficient iterative scheme is explored. Several numerical examples are presented to 
test the performance of the formulation and the iterative scheme. 

Keywords. Bioluminescence tomography, ill-posed problem, finite element method 

1 Introduction 

The function of molecular imaging is to help study biological processes in vivo at the cellular 
and molecular levels, see e.g. [8, 25, 29, 31, 32]. It may non-invasively differentiate normal from 
diseased conditions. While some classic techniques do reveal information on micro-structures 
of the tissues, only recently have molecular probes been developed along with associated 
imaging technologies that are sensitive and specific for detecting molecular targets in animals 
and humans. A molecular probe has a high affinity for attaching itself to a target molecule and 
a tagging ability with a marker molecule that can be tracked outside a living body. Molecular 
imaging contains a lot of modalities, sec [7] for detail. Among them, optical imaging, especially 
fluorescence and bioluminescence imaging, has attracted remarkable attention for its unique 
advantages regarding performance and cost-effectiveness. 

Bioluminescence tomography (BLT) [12, 33, 34] is an emerging and promising biolumi- 
nescence imaging modality. The major issue of BLT is the determination of the distribution 
of in vivo bioluminescent source. In BLT, we reconstruct an internal bioluminescent source 
from the measured bioluminescent signal on the external surface of a small animal. The prob- 
lem of determining the photon density on the small animal surface from the bioluminescent 
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source distribution within the animal requires accurate representation of photon transport in 
biological tissue. Photon propagation in biological tissue is governed by the radiative transfer 
equation (RTE) [24] . However, the RTE is highly dimensional and presents a serious challenge 
for its accurate numerical simulations given the current level of development in computer soft- 
ware and hardware. Because the mean-free path of the photon is between 500 nm and 1000 nm 
in biological tissues, which is very small compared to the size of a typical object in this con- 
text, the predominant phenomenon is scattering. Usually a diffusion approximation of the 
RTE is employed [1, 30]. 

Based on the diffusion approximation equation, many theoretical analysis and numerical 
methods are explored, see e.g. [6, 7, 10, 18, 27]. To improve the accuracy of the reconstructed 
light source function, multispectral systems are developed [4, 11, 19, 21, 28]. We refer to 
[20, 35] for the BLT problem related to the optical properties issue. A recent survey of 
biomedical background, mathematical theory and numerical approximation for BLT is [22]. 

In this paper, we study the BLT problem through a new perspective that leads to better 
convergence behavior for its numerical solution. 

In Section 2, a simplified version of the BLT problem is reduced to an operator equation. 
The operator is compact, which explains why the source function in the BLT problem does 
not depend continuously on measurements. 

In Section 3, based on the discussion in Section 2, a new Tikihonov-type regularized formu- 
lation is provided to compute the light source function. In the formulation, the measurements 
on the boundary are transferred to a knowledge in the problem domain, which makes the BLT 
problem more regular. The well-posedness of the new formulation, including the solution ex- 
istence, uniqueness and continuous dependence, is shown. The limiting behavior is discussed 
for the regularized solution when the regularization approaches zero. 

In Section 4, a finite element approximation for the BLT problem based on the new formu- 
lation is studied. Specifically, we use piecewise constants to approximate the source function 
and standard linear elements to approximate the state function. Error estimates are derived 
with improved convergence orders compared to those found in [16, 18]. 

In Subsection 5.1, an iterative scheme for the BLT reconstruction based on the given 
formulation is shown. By introducing adjoint equations, we avoid computing the inversion of 
an elliptic partial differential operator. Some detail on the implementation is also given. Two 
numerical examples are presented in Subsection 5.2 to show the numerical performance of the 
method discussed in this paper. 
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2 Ill-posedness of the BLT problem 

We first introduce the classical formulation of the diffusion based BLT problem. Let the 
biological medium occupies a non-empty, open and bounded set H C M. d , d < 3. The boundary 
L of the domain Q is assumed Lipschitz continuous. Denote by D = (3([x a + /-O) -1 with 
absorption coefficient /i a and reduced scattering coefficient fi' s , and denote by <9„ the outward 
normal differentiation operator. Then the classical formulation of the BLT problem based on 
the diffusion approximation is the following ([7, 18]). 

Problem 2.1 Given D > 0, fi a > 0, g\ and g 2 , suitably smooth, find a bioluminescent source 
p such that the solution u of the boundary-value problem 

- div(DVu) + fi a u = pxn hi Q, (2.1) 

Dd u u = g 2 on T (2.2) 

satisfies 

u = gi on T. (2.3) 

Here Q , known as the permissible region, is a measurable subset of ft, and xn is t ne 
characteristic function of Qq: Xn<)( x ) equals 1 for x G fio, and equals for x G £1/SIq. 

It is shown in [18] that the pointwise BLT Problem 2.1 may have infinitely many solutions 
or may have no solution, depending on the choice of the function set where we look for the 
source function. It is also mentioned there even when the solution existence and uniqueness 
issues could be settled, the source function solution does not depend continuously on the 
measurement. This section is devoted to a rigorous proof of this instability statement for the 
BLT problem in the simplified case where the permissible region is the entire domain and the 
admissible set for the source function is the entire L 2 (Q) space. 

We use standard notations for Sobolev spaces ([2]). Denote V = if 1 (Q), Vo = Hq(Q), 
Q = L 2 (Q ). Our basic assumptions on the data throughout the paper are: D e L°°(fi) and 
D > Dq for some constant D > 0, fi a G L 2 (£l) and \x a > \Xq for some constant \xq > 0, 
9l G # 1/2 (r), and g 2 G L 2 (T). We further let V gi = {v G V | v = g x on T}. 

Define 

a(u,v)= / (DVm ■ Vf + fi a uv) dx Vm,vgV. (2.4) 



J a 
Then a(-, •) is symmetric, continuous and coercive on V. Therefore, by the Lax-Milgram 
Lemma ([2, 13]), for any q G Q, the problems 

u D (q, gi) £ V gi , a(u D (q,g 1 ),v) = (q,v) Q \fveV (2.5) 
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and 

«jv(g, 92) e V, a(u N (q,g 2 ),v) = (q,v) Q + (g 2 ,v) L 2 (r) \/vEV (2.6) 

each has a unique solution. Moreover, 

\\u D (q, 9i)\\v < c(\\q\\ Q + |bi|| ff i/2 ( r)), (2.7) 

\\uN(q,92)\\v <c(\\q\\ Q + \\g 2 \\i?(r))- (2.8) 

We write u D (q) = u D (q,0), u D (g{) = u D (0,gi), u N (q) = u N (q,0), and u N {g 2 ) = u N (0,g 2 ). 
Then u D (q, g t ) = u D (q) + Uofa) and u N {q, g 2 ) = u N (q) + u N (g 2 ). 

Next we introduce a weak formulation for Problem 2.1 based on both Dirichlet and Neu- 
mann boundary problems (2.5) and (2.6). Define 

s(p, q) = (u D (p) - u N (p), u D (q) - u N (q)) L 2 { ci) Vp, q E Q, 

l(q) = (u N (g 2 )-u D (g 1 ),u D (q)-u N (q)) L 2 { n) Vg G Q. (2.9) 

Apparently, the bilinear form s is symmetric and positive semi-definite over Q. Moreover, 
by Schwarz inequality, trace theorem [3, Theorem 1.6.6], together with the bounds (2.7) and 
(2.8), we conclude that both s and / are continuous on Q. 

Denote by S : Q — ► Q the operator through the relation (Sp,q)Q = s(p,q) for any 
p,q G Q. Then S is symmetric, positive semi-definite and continuous. Moreover, from Riesz 
representation theorem, there is an element denoted again by I G Q such that (/, q)Q = l(q) 
for any q G Q. 

We let Hq — ^ m the rest of this section, and introduce a new problem. 

Problem 2.2 Find p G Q such that 

Sp = l in Q. (2.10) 

We have the following result. 

Proposition 2.3 If p G Q solves Problem 2.1, then it is a solution of Problem 2.2. 

Proof. Let p* be a solution of Problem 2.1. Then, Urj{p*,gi) = UN(p*,g 2 ). So 

ud(p*) - u N (p*) = u N (g 2 ) - u D (gi). 
Thus, 
{u D (p*) -u N {p*),u D {q) -u N (q)) L 2 {n) = (u N (g 2 ) -u D (gi),u D (q) -u N (q))v(n) Vg G Q, 
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i.e., 

s(p*,q) = l(q) VgGQ. 

Hence, p* is a solution of Problem 2.2. I 

It is possible to demonstrate the equivalence between Problem 2.1 and Problem 2.2. For 
this purpose, we introduce the following stronger smoothness assumptions on the data: 

rec 1 ' 1 , DeC ' 1 ^), ^ g # 3/2 (r), (? 2 g # 1/2 (r). (2.11) 

We recall the following result ([18]). 

Lemma 2.4 Let Vt C M, d be an open bounded subset with a C 1 ' 1 boundary and u G H 2 {VL). 
Then there are infinitely many functions v G H 2 (Q) such that 

•yv = ju, ydyV = ydyU. 

Here 7 stands for the trace operator. 

We have the following converse of Proposition 2.3. 

Proposition 2.5 Assume (2.11). Then a solution of Problem 2.2 solves Problem 2.1. 

Proof. Let p* G Q be a solution of Problem 2.2: 

(Sp*,q) Q = (l,q) Q VqeQ 

or equivalent ly 

(u D (p*,gi)-u N (p*,g 2 ),u D (q)-u N (q))L2(a)=0 Vg G Q. (2.12) 

Define an operator £ from H 2 {Q) to L 2 {Q): £u = —div(D'Vu) + \x a u for u G H 2 (Q), and set 
u* = ud{p*,9i) — Un{p*t92)- Then u* G H 2 {Vt) from (4.10) and (4.11) below, and 5w» = 0. 
Denote g\^ = 7W* and g 2 ^ = 7-Dc^w*. Then, g 1:¥ G i/ 3 / 2 (r) and g 2 ,* G i^ 1//2 (r). By Lemma 
2.4, there exists a function ud,* 7^ w* in H 2 (Q) such that 

Wd,* = 0, Dd u un,* = #2,* on T. 

Let g* = ^u Di » G Q and «at,* = «k,* —u^E H 2 (£l). Then -u^,,* = u D (q*) and w* = «d,* — Un,*- 
From 

5 mat,* = £ ud,* — £ w* = g* — = g* in Q 

and 

Dd v UN,* = Dd u Uo,* — Dd u u* = g 2 ,* — Dd v ii* = on T, 
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we know that un,* = un(q*), i-e., there exists a g* G Q such that uo(q*) — un(q*) = u*. 
Substitute g* for g in (2.12) to give Ud(p*,9i) = Un(p*,92), which shows that p* is a solution 
of Problem 2.1. I 

We now present a result on the compactness of the operator S. 

Theorem 2.6 The operator S : Q ^ Q is compact. 

Proof. Let {p n } n C Q be bounded. Then there is a subsequence, denoted again by {p n }m 
which converges weakly in Q to some element p* G Q because of the reflexivity of space Q. 

Let u n D = u D (p n ), u n N = u N (p n ), i.e., u n D G V ,u% G V, and 

a(u n D ,v) = (p n ,v) Q VveV , (2.13) 

a(u n N ,v) = (p n ,v) Q VveV. (2.14) 

Then {«£>}« an d { M Af}n are bounded in V from the properties (2.7) and (2.8). Hence, we can 
extract two further subsequences, denoted again by {«£)}« an d { u %}n, which converge weakly 
in V and strongly in Q to u* D G Vq and w^y G V, respectively. Let n — > oo in (2.13) and (2.14) 
to get m^ = ud(p*) and -u^ = ujy(p*). Strong convergence of {w^jn to w^ in V follows from 

a IK -u*d\\v< «(«o - «d, u n D ~u* D )= / (p n - p*) (u n D - u* D ) dx^O 

as n — > oo. Similarly, -u^ — > -u^ as n — ► oo. 

Denote s n = <Sp n . Then {s n } n is bounded in Q. Repeating the above argument, we 
conclude that there exists an element s* G Q such that 

s n — ^ s* in Q, U£>(s n ) — »■ u D (s*), U]y(s n ) — > u N (s*) in V as n — > oo. 

Since 

(s n , g) Q = (<Sp n , g) Q = s(p n , g) = (u£ - u£, «o(g) - «;v(?))L 2 (n) Vg g Q, 

we have s* = Sp* by letting n — »■ oo. Consequently, strong convergence of s n to s* in Q follows 
from 

\\s n -s*\\l = (Sp n -Sp*,s n -s*) Q 
= s(p n -p*,s n -s*) 

= {u n D -U* D - {u n N - U* N ),U D ( S n ) - U D (S*) - (u N ( S n ) - U N (S*))) L 2 { Q) 

< (IK - u* D \\v + IK - u* N \\ v ) (\\u D (s n ) - u D (s*)\\v + \\u N (s n ) - u N (s*)\\ v ) 

as n — ► oo, and the proof is complete. I 

Compactness of S explains the instability of solutions of the BLT problem with respect to 
the measurement data. 
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3 A new regularized reformulation for the BLT problem 

In this section, a Tikhonov-type regularization method is used to reconstruct numerically 
the source function of the BLT problem. We seek the source function in an admissible set 
Qad C Q, which is assumed to be bounded, closed and convex set. Usually, we take 

Qad = {q e Q \ q>0 a.e. in Q }. 

For any e > 0, define a bilinear form s e (-, •) over Q x Q: 

s £ (p,q) = s(p,q)+e(p,q) Q Vp,gGQ. (3.1) 

Observe that s £ is symmetric, continuous and coercive on Q x Q for each e > 0. 

Note that Problem 2.2 is equivalent to minimizing the functional \ s(q, q) — l(q) over the 
space Q. Thus, we introduce the following regularized reformulation of the BLT problem. 



Problem 3.1 Find p £ G Q a d such that 



where 



J £ (p) = min J £ (q), 
q&Qad 



Je{q) = -s £ (q,q)-l(q). 



1 ||~ / ^ ~ / ^ ,,0 . £ ,, ,|2 



Easily, 

Je{q) = ^\\uD(q,9i)-u N (q,g 2 )\\l2 {n) -^\\u D (g 1 )-u N (g 2 )\\ 2 L 2 {n) +^\\q\\ 2 Q- 
We note that Problem 3.1 is equivalent to finding p £ G Q a d such that 

s £ (p £ ,q-p £ )>l(q-p £ ) VgGQ ad . (3.2) 

If Qad is a subspace of Q, (3.2) reduces to 

Se(pe, q) = l(q) Vq^Qad- 

Similar to S, we denote by S £ : Q — > Q the operator such that (S £ p, q)Q = s £ (p, q) for any 
p,q G Q. Then S £ = S + el is symmetric, continuous and coercive for each e > 0, where X 
stands for the identity operator over Q. 

Next we discuss the well-posedness of Problem 3.1. Standard arguments ([2, 26]) lead to 
the next result. 
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Proposition 3.2 Problem 3.1 admits a unique solution p £ in Q. 

Regarding solution stability of Problem 3.1, we have the following result. 

Theorem 3.3 The solution p £ of Problem 3.1 depends continuously on e > 0, D G L°°(Q), 
H a G L°°(Q), 9l G H l l 2 {T) and g 2 G L 2 (F). 

To prove Theorem 3.3, we need some preparations. Let o(-, •) be given in (2.4). Assume 
D 5 G L°°(Q) and fi s a G L°°(Q) such that \\D 5 \\ L oa^ < 5 D and 1 1 A** 1 1 x,°° («) < $ Ho for a small 
positive constant 5 to be specified below. For any q G Q, let u s D (q, g\) G V gi and u s N (q, g 2 ) G V 
be the unique solutions of the problems 

a 5 (u,v) = (q,v) Q VveV , (3.3) 

a 5 (u,v) = (q,v) Q + (g 2 ,v) L 2 {r) VvEV, (3.4) 

respectively, where 

a 5 (u, v)= I \(D + D 5 ) VuVv + {fi a + rf a ) u v) dx (3.5) 

Jn 

and S < 1. Again, we rewrite u 5 D (q), u 5 D (gi), u 5 N (q) and u s N (g 2 ) for u 5 D (q, 0), u 5 D (0, gi), u 5 N (q, 0) 
and ^^(0,^2) respectively. Then we have the following estimates. 

Lemma 3.4 For a properly small 5 > and any q G Q, there exists a constant c such that 

\\ u d((1) - u D(q)\\v < c8\\q\\ Q , 
\\u 5 N (q) - u N (q)\\ v < c6\\q\\ Q , 
\\u S D (gi) - u D {gi)\\ v < c5||5fi|| H i/2 (r) , 
II^U2) - u N (g 2 )\\ v < c5\\g 2 \\ L 2 {T) . 

Proof. We prove the first estimate, the others can be verified similarly. We recall that 

a(u D (q),v) = (q,v) Q Vv G V Q . (3.6) 

Subtract (3.6) from (3.3) to get 

a(u s D (q)-u D (q),v) = - f (D s Vu s D (q)Vv + pt S a u S D (q) v) dx \/veV . 

Jn 

Because u s D (q) — u D (q) G V for any q G Q, we can take v for u 5 D (q) —Urj(q) in above equation. 
Then by using of the coercivity of bilinear form a(-, •) and Schwarz inequality, we obtain 

\\u S D (q)-u D (q)\\v<c6\\u s D (q)\\ v . (3.7) 
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For 5 < 1, from the regularity property (2.7), we have 

||«i?(g)||v < c||g||g 

which reduces (3.7) to 

\\ u d((1) ~ u D(q)\\v < c8\\q\\ Q . 

Therefore, we complete the proof. I 

Let e s ER.gfe H l / 2 (T) and g 5 2 G L 2 (T) be such that \e s \ <e5, \\g(\\ H ^(r) < 8\\9i\\hV*F) 
and ||^2lU 2 (r) < ^||<?2||.L 2 (r), and define bilinear form sf (•, •) and linear functional l s (-) by 

s 5 e (p, q) = (u S D (p) - u 5 N (p), u S D (q) - u S N (q)) L 2 (n) + (e + e 5 ) (p, q) Q Vp, q G Q, 
l 5 (q) = (u s N (g 2 + g 5 2 ) - u s D ( 9l + g[), u s D (q) ~ u s N (q)) L 2 {n) Vg G Q. 

Note that u s N (g 2 + g s 2 ) = u s N (g 2 ) + u s N (g s 2 ) and u s D (9i + 9x) = u 5 D { gi ) + u 5 D {g{). Denote by p s e 
the solution of Problem 3.1 with s £ and / replaced by sf and I s respectively. For < 8 < 1, s s £ 
is symmetric, bounded and coercive on Q, and I s is continuous on Q. Hence, for < 5 < 1, 
p 5 £ uniquely exists. Set As s (p, q) = s s £ (p,q) — s £ (p,q) and Al s (q) = l 5 (q) — /(g). Note that 
As s (p, q) is independent of e. We have the following bounds on As s and Al s . 

Lemma 3.5 There is a constant c > such that for any p,q G Q and < 8 < 1, 

\As 5 {p,q)\<c8\\p\\ Q \\q\\ Q , (3.8) 

\Al\q)\ < c5(\\ 9l \\ Hl/2{r) + |M|L 2 (r))IMlQ. (3.9) 

Proof. From definitions of s e and sf , for any p,q G Q, we have 

As S (p, q) = (u S D (p) - u D (p) - (u 5 N (p) - u N (p)), u S D (q) - u S N (q)) L 2 (n) 

+ (u D (p) ~ u N (p),u 5 D (q) - u D (q) - (u 5 N (q) - u N (q))) L 2 {n) + e 5 (p, q) Q , 
Al\q) = (u%(g 2 ) - u N (g 2 ) - (u 5 D (gi) - u D (gi)),u s D (q) - u s N (q)) L 2 {n) 
+ (u N (g 2 ) - u D (gi), u s D (q) - u D (q) - (u 5 N (q) - u N (q))) L 2 {n) 
+ (u s N (g S 2) - u s D (9 S i),u s D (q) - A(q))L> { ay 

Consequently, (3.8) and (3.9) follow immediately from Schwarz inequality, Lemma 3.4, and 
the regularity properties (2.7), (2.8). The proof is complete. ■ 

Proof of Theorem 3.3. Now we can prove the stability Theorem 3.3. We recall that p £ G Q a d 
and p s £ G Q a d are such that 

s £ (p £ ,q-p £ )>l(q-p £ ) VqeQad (3.10) 

and 

s 5 M,q-p 5 £ )>l 5 (q-p 5 £ ) WqeQ ad (3.11) 
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respectively. Replace q = p £ in (3.10) and q = p £ in (3.11) and add them to get 

sM - Pe j £ - Pe ) < -(4( P 5 £ - Pe j £ ) - s £ (p 5 £ - p £ , P 5 £ )) + i 5 (p 5 £ - Pe ) - i(p 5 £ - Pe ) 

= -As s (p s £ -p £ ,p 6 £ ) + Al 5 (p s £ -p £ ). 
From the coercivity of s £ for e > together with Lemma 3.5, we have 

a (z)\\p S s ~ PsWl < S £ (P 5 £ -P £ ,P S £ -P £ ) 

<\As s (p 5 £ -p £ ,p 5 £ )\ + \Al s (p 5 £ -p £ )\ 

< cS\\p s £ -p £ \\ Q (\\p £ \\ Q + |bi||^i/2 ( r) + IMIz^r)) 

< cS\\p 5 £ -p £ \\ Q (\\p s e - p e \\cj + \\Ps\\q + lbi||^V2(r) + \\92\\L^r)) 

or 

(a(s)-c5)\\p 5 £ -p £ \\ Q < cS(\\p £ \\ Q + \\gi\\ H i/2 {r) + ||^ 2 |Ua(r)) , (3.12) 

where a(e) is a positive coercivity constant for bilinear form s £ which may depend on e. For a 
small enough 5, there is a positive constant c(e) independent of 5 such that a(e) — c5 > c(e). 
Then (3.12) reduces to 

Ibf -pb\\q < c^ (Ibellq + |bi||ffV2(D + II^IU^cd) 

which shows the convergence p £ — > p £ in Q when 5^0. □ 

We now explore the limit behavior of the solution of Problem 3.1 as the regularization 
parameter e — > 0. Denote by Z the solution set of Problem 3.1 with e — 0. Then if it is 
non-empty, Z is a closed, convex subset of space Q. Denote by p* the unique element in Z 
with minimal Q norm, that is, 

\\p4q = inf \\p\\q. (3.13) 

We have the following convergence result; its proof is similar to that of [18, Proposition 3.5]. 

Theorem 3.6 Assume the solution set Z is nonempty. Then 

Pe — ¥ P* in Q, as £ — * 0. 



4 A finite element approximation 

In this section, we consider the problem of approximating the solution of Problem 3.1. Stan- 
dard finite element method (FEM) is applied to discretize this problem. 

We use constant finite element space for an approximation of the light source space Q. 
Specifically, let {T 0j h}h be a regular family of triangulations over domains £1 C fl with 
meshsize H > 0. For each triangulation Tq^h = {Kh}, define finite element space Q H = {T G 
Q | T\k h € Vo(K), V Kh € %,h}, and set Q^ d = Q a d H Q H . Here we use Vk for the space of 
all polynomials of degree < k. Then we can define the following discrete problem. 
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Problem 4.1 Find a function pf G Q^ d such that 



s £ (p?, q H -p?)>l(q H -p?) Vq H eQZ d . (4.1) 

Similar to the continuous case, we have the following well-posedness result for Problem 
4.1. 

Proposition 4.2 For each H > 0, Problem J^.l has a unique solution p^ which depends 
continuously on all data. 

As for error estimate of the finite element solution p^ of Problem 4.1, we first derive an 
abstract error estimate. 

Theorem 4.3 There exists a positive constant c which is independent of e > and H > 
such that 

e 1/2 \\p?-Ps\\ Q <c M (llqV-p^ + llS^-lllljV-p^ 2 ). (4.2) 

Proof. By definition, 

«e(pf ~Pe,p" -Pe) = \\u D (pf ~ Pe) ~ U N (pf ~ Pe)\\h { U) + £ \\pf ~ Pc\\q- ( 4 - 3 ) 

Adding (4.1) and (3.2) with q = p^ , we obtain 

< S £ (p £:P f - p £ ) + S e (pf, q H ~ pf ) - l(q H - Pe)- 

Use this inequality, 

«e(pf - PcP" ~ Pe) < S E (p", q H - Pe) - l(q H - Pe)- 

Thus, 

Ss(Pe ~Pe,Pe -Pe) 

< S E (pf ~ Pe, q H - Pe) + {S £ Pe -l,^ ~ Pe) Q 

< \\ u d(P? ~Pe) ~ U N (pf - p £ )\\ L 2 {n) \\u D (q H - p £ ) - U N (q H - p e ) \\ L 2 {n) 
+ e\\pf - Pe\\Q\\q H ~ Pe\\Q + \\SePe ~ l\\ci\\q H ~ Pe\\q- 

Combining this inequality with (4.3), we deduce that 

\\U D (P? ~ Pe) - U N (pf - p £ ) \\ 2 L 2 {n) + s\\Pe ~ Pefq 
< C 



\u D (q -Pe)~U N (q -Pe)\\L*(0)+ £ h ~ Pc\\q + \\$ePe ~ 1\\q\\Q ~P 

from which we conclude (4.2). 

A direct consequence from Theorem 4.3 is in the following. 
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Corollary 4.4 p^ — ► p £ in Q when H — »■ 0; moreover, if p £ G i? 1 (^o); we /lave 

^ 1/2 |bf-Pe||Q<^ 1/2 (4.4) 

wit/i c depending on |p £ |_ffi(n ) but independent of e > and H > 0. 

We note that the convergence order in (4.4) is not optimal and an improvement is possible. 
In fact, using the technique in [18, Lemma 4.7], we have 

Proposition 4.5 There is a constant c independent of H > such that 

eVHrfl - r) \\n < cH 1/2 \\U H n - n ll 1/2 

fc \\Pe Pe\\Q _i CS2 II 11 Fe FellQ ■ 

Consequently, if p £ G if^Qo); 

e l,2 \\p^-Ve\\ Q <cH\p £ \f lm . 

An examination of the definitions of s £ (p H , q) and /(g) shows that we need further to ap- 
proximate such terms like ujj(q, <?i), Ud(q), and ud{9i)- Continuous piecewise linear functions 
will be utilized for this purpose. Let {Th}h be a regular family of triangulations over domains 
lies' 1 with a meshsize h > 0. For each triangulation % L = {Kh}, define finite element spaces 
V h and Vq as follows. 

V h ± { v e C(J}) | v \ Kh e Vi, \/K h g %}, V h = V h n v . 

For simplicity, in the following discussion, we further assume (2.11). We will use the same 
symbol g\ G H 2 (Q) for its trace g\ G H 3 ^ 2 (T). Denote by HyhV for the piecewise linear 
interpolant of v G H 2 (£l) and let g\ = Hv h 9i £ V h '■ Moreover, we will use the symbol g\ + V h 
for the set 

{v G V h | v(ai) = ffi(tti) Vvertex ^ G tf fc n I\ V K h G T ft }. 

For each q <E Q, denote by u%(q, gf) G g\ + Vq 1 the unique solution of 

a(«,u) = (g,u) g VueVtf (4.5) 

and by u%{q,g 2 ) G V h the unique solution of 

a(u, v) = (q, v)q + (g 2 , v) L 2 {v) Vw G V h , (4.6) 

where o(-, •) is defined in (2.4). Similar to the continuous case, we use the symbols u%(q), 
u>d(9i)i M w(°) and u h N {g 2 ) for u%(q,0), u%(0,g%), u%(q,0) and u%(0,g 2 ), respectively. 
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Now we give a discrete counterpart of the bilinear form (3.1) and the linear function (2.9). 
Given e > 0, for any p, q G Q, 



s h £ (p, q) = s h (p, q)+e (p, q) Q = (u h D (p) - u h N (p), u h D (q) - u h N (q)) L 2 {Q) + e (p, q) Q , (4.7) 

2(a)- 



1%) = (u h N (92)-u h D (9i),u h D (q)-u h N (q))mn). (4.8) 



Then s^ is symmetric and coercive for each e > 0, and both s^ and l h are uniformly bounded 
with respect to h. 

We now introduce a full discretization for Problem 4.1. 

Problem 4.6 Find p^ G Q^ snc/i £/ja£ 

*?(?£*, Q H ~ P h e H ) > l h (q H ~ P h £ ' H ) Vq H e Q H ad . (4.9) 

The following result holds. 

Proposition 4.7 Problem 4-6 admits a unique solution p h e ,H in Q^ d , and the solution depends 
continuously on the data. 

The rest of this section is devoted to an error estimation for Problem 4.6. We first present 
some preliminary results. 

From [17, Theorem 2.4.2.5 and Proposition 2.5.2.3], under the assumptions (2.11), we have 
the regularity properties: 

\\u D (q,9i)\\m(n) <c(\\q\\ Q + ||ffi|| H s/a (r) ), (4.10) 

\\u N (q,g2)\\H\n) <c(\\q\\ Q + |Mltfi/2(r))- (4.11) 

Using these regularity properties together with Aubin-Nitche trick, we have linear finite ele- 
ment error estimates in the following (see [9, Theorem 3.2.5] for detail). 

Lemma 4.8 For any q G Q,gi G H 3 ^ 2 (T) and g 2 G H l l 2 (T), there exists a constant c inde- 
pendent of q, <7i,<?2 and h such that 



Wh(q) ~ U D (q)\\ L 2 {n ) < ch 2 \\q\\ Q , 
\u%(q) - U N (q)\\ L 2 {n) < ch 2 \\q\\ Q , 
\u D (gi) ~ «c(#i)IU 2 (n) < c/i 2 ||^i||^3/2 (r) , 
\un{92) -u h N {g 2 )\\L*{tt) < ch 2 \\g 2 \\ H i/2 {r) . 
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Let As h (p,q) = s h (p,q) — s(p,q) and Al h (q) = l h (q) — l(q) for p,q G Q. Then we have 
estimates for As h and Al h as follows. 



Lemma 4.9 For any p,q G Q, there is a constant c which is independent of p,q,gi,92 and 
h > such that 

\As h (p,q)\<ch 2 \\p\\ Q \\q\\ Q , (4.12) 

\Al h (q)\ < ch 2 (|MU 2( r) + IMU 2(r) )IMlQ- (4.13) 

Proof. We rewrite As h (p, q) in the following way: 

As h (p, q) = (u h D (p) - u%(p), u h D (q) - u%(q)) L 2 {n) - (u D (p) - u N (p),u D (q) ~ u N (q)) L 2 {n) 
= (ud(p) ~ u h N (p), u h D {q) - u D {q) - (u h N (q) - u N (q))) L 2 (n) 
+ ( u d(p) ~ u d(p) ~ (u%(p) - u N (p)),u D (q) - u N (q)) L 2 {n) . 

Then from Lemma 4.8, and together with regularity properties for u h D (p), Md(q'), u%(p) and 
u N(q), we obtain (4.12). 

Similarly, we decompose Al h as follows: 

Al h (q) = (u h N (g 2 ) - 4^), u h D (q) - u h N (q)) L 2 {n) - (u N (g 2 ) - «d(#i), u D (q) - u N (q)) L 2 {n) 
= (u h N (g 2 ) - u h M), u h D (q) - u D (q) - (u h N (q) - u N (q))) L 2 {n) 
+ {u h N (g 2 ) - u N (g 2 ) - (u h D (g^) - u D (g l )),u D (q) - u N (q)) L 2 {n) . 

From the regularity properties of u%(g 2 ),u h D (g^),U£j(q) and UN^q), by use of Lemma 4.8, we 
obtain (4.13). ■ 

We now present an error bound for the finite element solution from Problem 4.6. 

Theorem 4.10 There exists a constant c > 0, independent of h, such that 

e\\p h e ' H -pf \\q < ch 2 (\\p £ \\ Q + \\gi\\ H s/2 {r) + IMI^r))- (4.14) 

Proof. From (4.9) with q H = p^ , we have 

e\\p h £ > H -P?\\ 2 Q <s h M> H -P?,P h s> H -P?) 

= S h s(P^Pf ~ P k s' H ) ~ S k M' H ,P? - P k s H ) 

< s h M^-P h e H ) + 1\pY-P H s)- (4-15) 

Write 

l\p^ - pf ) = Al\p h £ ' H - pf ) + l(£" - pf ). 
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From (4.1) with q H = p^ ,H , and together with (4.15), we obtain 



e\\ P h £ > H ~ Pf III < S h M,Pf ~ P k s' H ) + U h (p h e H - Pf ) + *.(pf ,P k e' H - Pf ) 

= As h (pf ,pf -p^ H ) + A/ h (p*> H -pf). (4.16) 



By use of (4.12) and (4.13) for As h and AP, we obtain (4.14) from (4.16). ■ 

Combining Theorems 4.3 and 4.10, a full error estimate of finite element approximation is 
as follows. 



Corollary 4.11 Let p £ andp^' be the unique solutions of Problems 3.2 and 4-9 respectively. 



Ji,H 
•iicxi y <±.ii ucl jj £ ana f 

Then 



Pe —> Pe in Q as h, H — >• 0. 

Moreover, if p e G H l (Q ), then there exists a constant c > 0, depending on ||p £ ||iji(a ) but 
independent of h and H, such that 

z\\Pe' H - PsWq < cHe 1/2 + ch 2 (\\p £ \\ Q + ||^i||^3/2 (r) + ||^ 2 ||i/i/2(r))- 



h,H 

£ 



At last, we comment that when the solution set Z is nonempty, the convergence of p 
to p* follows from the triangle inequality 

\\Pe' H ~ P*h < \\Pe' H ~ Peh + \\Pe ~ P*h 

in conjunction with Theorem 3.6 and Corollary 4.11. 



5 Numerical simulation 

In this section, we present some numerical results based on our new formulation for the BLT 
problem. First, we introduce an iterative scheme for this formulation. Then we provide 
a detailed finite element discretization process of the iterative algorithm. Finally, we show 
numerical results from two examples. 

5.1 An iterative algorithm for the BLT problem 

Let S, I, S e , and ud(q,9i), ud(q), Mfl(ffi), Un(q,92), un(q) and Wat^) be given in Sections 2 
and 3. 

Define two operators Ad and .4.^ from Q to H 2 (Q) by 

A D q = u D (q), A N q = u N (q) WqeQ, 
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and view them as two operators from Q to L 2 (Q). Set A = Ad — An and denote by b = 
UN{g-2) — uo(gi) G L 2 (Vt). Then for any q £ Q, 

Aq-b = (Ad - A N )q-b = u D (q,gi) -u N (q,g 2 ). 

Denote by A* D and .4.^ the adjoint operators of Ad and An'- 

(A* D v,q) Q = (v,A D q)mn), (A* N v, q) Q = (v,A N q)mn) Vu G L 2 (H), g G Q. 

Then A* : L 2 (Q) — *■ Q is such that A* = A* D - A* N . Consequently, for any p,q £ Q, 

Se(p,q) = (Ap,Aq) L 2 {n) +e(p,q) Q = (A* Ap, q) Q + £ (p,q) Q = ((A* A + el) p,q) Q . 

Therefore, S e = A* A + si. Similarly, I = A* b comes from l(q) = (b, ^4g)L 2 (n) = (A* b,q)q. 

For any q £ Q, denote by u D N(q) = u D (q,gi) - u N (q,g 2 ), and by w D = w D (u D N(q)) G V 
and wn = wjv(wdat(9)) G V the solutions of the adjoint variational problems 

a(v,w D ) = (u D N,v) L 2 (n) \/v£V a (5.1) 

and 

a(v,w N ) = (u D n, v) L 2(n) VwGV, (5.2) 

respectively. Then w D (u DN (q))\n a = A* D (u D N(q)) and w N (u D N(q))\a = A* N (u D N(q))- Thus, 
A* (Aq-b) = (A* D - A* N )(u DN (q)) = (w D (uDN(q)) - w N (u D N(q)))\n - 

Let Pad be the projection operator from Q onto Q a d- Following [14, Chapter I, Remark 
3.3], we consider an iterative scheme for solving (3.2). 

Algorithm 5.1 1 Choose p° £ Q a d-> se t k — 0. 
2 For k — 0, 1, ... , with p k £ Q ad known, 

2.1 solve (2.5) and (2.6) to get u D = UD(p k ,gi) and u N = UN(p k ,92)', 

2.2 compute f k = u D —u N ; 

2.3 solve (5.1) and (5.2) with udn{<i) replaced by f k to obtain w D = WD{f k ) and 



w N = w N (f); 

2.4 compute w k = w D — w N ; 

2.5 p k+1 = W p (p k ) =p k - p(w k \n + ep k ); 

2.6 project p k+1 onto admissible set Q a d- P k+1 = P a dP k+1 - 
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Note that under our priori smoothness assumptions on the given data, w^ G H 2 (Q), 
w% G H 2 (Q), and thus w k G H 2 {Q). Since H 2 {Q) -> C(H), w% is well defined. From (3.7) 
in the proof of [14, Chapter I, Theorem 3.1], for the iterates to converge, we should select 
those p which guarantees the operator W p = W p P a d over space Q is a strictly contractive 
mapping. By the contractivity of the projection operator P a d, we only need to show W p is a 
strictly contractive mapping. From (S £ q,q)Q > £\\q\\n, we have, for any (71,(72 G Q, 

\\W p (qi) - W p {q 2 )\\ 2 Q = ||(7i - q 2 f Q - 2 p(S £ ( qi - q 2 ), qi - q 2 ) Q + p 2 \\S £ { Ql - q 2 )\\ 2 Q 

<(l-2pe + p 2 \\S £ \\ 2 )\\ qi -q 2 \\ 2 Q: (5.3) 



where 



P eQ,p^o HPIIq peQ,p^o q &Q,q^o \\P\\q\\Q\\q P eQ, P ^o q &Q,q^o \\P\\q\\Q\\q 
Thus, W p is a strict contraction mapping if < p < 2e/\\S £ \\ 2 with S £ = A*A + eX. Moreover, 



leu WSePh \(S e p,q) Q \ \s £ (p,q)\ 

\o £ \\ = SUp — 77— = SUp SUp -77— r. — = SUp SUp 



the contraction factor (1 — 2 pe + p 2 ||<S £ || 2 ) in (5.3) attains its minimum at p = s/\\S £ 



Next, we discuss a discrete analogue of Algorithm 5.1 for the discrete Problem 4.6. For 
convenience, we assume T h and 7^ are consistent, i.e., the triangulation Tqh is a restriction of 
the triangulation 7^ on Cl . Let {?i} i J 1 and {7^}^ be the elements in fl and Qq, respectively, 
where N t and Nq denote the number of elements in Q and Qo respectively. Any q H G Q H with 
g^ |tj = qk, 1 < A; < A , can be written as q H = Ylik=iQkXk, where Xk is the characteristic 
function of the element T ik . Set q = (91, g 2 , • • •, <7;vo)*; where (•)' stands for transposition of (■). 
As a result, we can define an isomorphism Jq : M> N ° — » Q H through 

q H = Jqq- 

Let n be the number of nodes of the triangulation 7^, and let ipi(x) G V h , 1 < i < n, 
be the node basis functions of the finite element space V h associated with grid nodes x,. 
Then, for the problems (4.5) and (4.6), the solutions u h D G g\ + Vq and u% G V h can be 
expanded by u h D = *YTi=\ u D,i L Pi an d w at = YH=i u N,i¥ii respectively, where ur>,% = U£)( x i) 
and u Nyi = u h N (xi). Let I = {1,2, • • -,n}, 7 6 = {i G / | x* G T}, J = {1,2,- • -,N }, 
Ij — {k G Iq I Xj is a vetex of element Tj fe }, j G J. Moreover, define 

A = (dji), ciji = / DVipiVipjdx, i,j G /, 
Jn 

M = (rriji), rriji = / p a ipiipjdx, i,j G /, 



n 



[0, k G JoVj, 

Z = (Zi, Z 2 , • • ", Z n )*, Zj = / 02 ¥>j d«, 

K = A + M. 
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In what follows, we use the same symbol for a finite element function and its vector represen- 
tation associated with the given finite element basis functions. Then u k D and u k N in Algorithm 
5.1 can be calculated by 

n 

Ku k D = Fp k , u% ti = g 1 (x i ),iel b , u k D = J2 u D,iVi 



i 

i=l 



Ku k N = Fp k + z, u k N = ^u k N ^i 
Similarly, if we define 



i=i 



C = (cji), Cji = / ifiifj dx, i,j G I 
Jn 

then 

n 

Kw k D = Cf k , w k Dyl = 0, i G I b , w k D = J2 ™ k D,i<Pi, 



i=i 



Kw k N = Cf k , w k N = J2 w N,Wi- 



As for the realization of Step 2.6 in Algorithm 5.1, let Q a d = {p £ Q \ P > a.e. in Qq}- 
Then q H G Q^ d <$> Jq 1 q H G M + ° = {gG IR^ | q > 0}. Consequently, the projection operator 
P a d has the form: P a dq H = Jq (maxjjg 1 q H , 0}) for any q H G Q H . 

The stopping criterion is as follows. Assume the measurements on data are polluted by 
noise with noise level 8 > 0. Then the stop criteria is 

||/ +1 -/||q<A^ (5-4) 

for some constant \x > 1. The value of fi affects the iterative times and the accuracy in the 
reconstructed solution. 



5.2 Numerical examples 

We reconstruct light source function based on Problem 4.6 by applying Algorithm 5.1. The 
computational results presented here are performed by using a M ATLAB code in a Dell OP- 
TIPLEX GX280 (32-bit-capable 3.00GHz Pentium 4 CPU, 256MB of RAM). In all tests, we 
reconstruct the source function solution for different arguments including regularization pa- 
rameter e, meshsize h, noise level S and parameter [i. We note that all these parameters affect 
the accuracy of the approximate source function and the iterative number in Algorithm 5.1, 
etc. Many references, e.g. [5, 15, 23], can be consulted for a proper regularization parameter 
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Figure 1: Sample Delaunay triangulations 



e in the presence of noise. In our examples, we take the regularization parameter e = 10 7 . 
We use 



u + 2 Dd v u = on T, 



(5.5) 



as the boundary condition for the PDE (2.1), which is resulted from the condition that the 
experiments are implemented in a dark environment. Then we take 



g = —Dd v u on F 



(5.6) 



for the measurement on the boundary. From (5.5) and (5.6), we have g\ = 2g and g 2 — ~9- 
Let g, then g 1 and g 2 , be polluted by noise with level 5 = 10%. The admissible set is taken 
to be Q a d = {q G Q \ q > a.e. in Q }. We use Delaunay elements for the triangulations 
{%i}h of the problem domain Q and {%,h}h of the permissible domain Qq, and assume they 
are consistent so that H = h. See Figure 1 for examples of Delaunay triangulations. Denote 
by Pz ,h the reconstructed approximate light source and by E = \\p^' h — p\\q/\\p\\q the relative 
error in L 2 -norm. 

In the first example, we let the problem domain Q=(0, 10) x (0, 10), and the absorption 
and reduced scattering coefficients /i a and fi' s be constant in the whole domain with values 
0.040 and 1.5 respectively. For our simulation, we take p = 1 pW for the true light source 
in 0, = {(7.5,8.75) x (2.5,3.75)} U {(7.5,8.75) x (6.25,7.5)}, and solve the equation (2.1) 
with boundary condition (5.5) to get the state u by the FEM in a triangulation with a small 
meshsize. In this example, we choose h = 0.1213 with 91136 elements and 45889 nodes in Q 
for a triangulation with a small meshsize. Then from (5.5) and (5.6), we set g = u/2 for the 
measurement g and thus g\ and g 2 (with noise) are obtained. Take the permissible domain to 
be fi = ^*- 
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The reconstructed source 




The state u corresponding to the left light source 
10 « 




x(mm) 



Figure 2: Left: p h e ' h — p in Q ; right: u h corresponding to p 



i i.i i 

e 



Table 1: Iterative number, computation time and relative error E for different p and for 
meshsize h = 0.3750 in the first example 



p 


0.175 


0.17 


0.15 


0.10 


0.05 


0.01 


0.005 


0.001 


iter-num 

cpu-time 

E 


161 
30.61 s 
55.35% 


39 

7.95 s 
26.25% 


9 
2.19 s 
11.88% 


4 
1.05 s 
9.10% 


3 

1.03 s 
6.49% 


2 

0.84 s 

14.08% 


1 
0.66 s 

18.43% 


1 
0.66 s 
19.69% 



We reconstruct source function solution p h e ' h for different parameter p and meshize h. 
Because the norm of the operator S e is difficult to compute, it is not easy to give an upper 
bound for p, not to mention the best p. We can take the smallest value p max from those p 
which make the corresponding sequence {||p fc — p fc ^ x || 2 j-fc nondecreasing during the iteration as 
an upper bound for p. In this test, a value near 0.05 for p appears to be an advisable choice for 
a good reconstruction. As for the stopping criterion (5.4), we take p = 5. We plot p^' h —p and 
the corresponding state of this reconstructed light source function for p = 0.05 and h = 0.3750 
with 5696 elements and 2929 nodes in Figure 2. We show the effect of the parameter p on 
the iterative number, the computation time, and the accuracy of the regularized approximate 
source in Table 1. The dependence of these terms on the meshsize h is provided in Table 2. 



Table 2: Iterative number, computation time and relative error E for different meshsize and 
for p = 0.05 in the first example 



h 


1.2646 


0.6767 


0.3750 


0.2133 


iter-num 
cpu-time 

E 


2 
0.16 s 
13.63% 


3 
0.39 s 
10.91% 


3 

1.03 s 
6.49% 


4 
5.86 s 
3.89% 
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The exact light source function 




The state corresponding to the exact light source 

n 




0.3 



0.2 



0.8 1 1.2 

x(cm) x(cm) 

The reconstructed light source functionThe state corresponding to the reconstructed light source 

|0.3 




n 



i 



0.8 
0.6 
0.4 
0.2 




y 

light i 

n 



■0.2 



0.1 



x(cm) 



Figure 3: Left: p and p^ ,h in Q Q ; right: corresponding state u and u h 



Table 3: Iterative number, computation time and relative error E for different p and for 
meshsize h = 0.1336 in the second examp 



p 


1.1 


1.0 


0.5 


0.1 


0.05 


0.01 


0.001 


iter-num 


71 


16 


3 


8 


8 


1 


1 


cpu-time 


12.30 s 


3.20 s 


1.02 s 


1.61 s 


1.86 s 


0.69 s 


0.69 s 


E 


10.30% 


3.51% 


0.53% 


1.28% 


3.27% 


7.09% 


7.22% 



In our second example, let Q be a circle located at origin with radius 2. Let Q\ — {(x, y) 6 
£1 | y/ ' x 2 + y 2 < 0.6}, fi 2 = {(x,y) E Q | 0.6 < y/x 2 + y 2 < 2}, and let the absorption 
and reduced scattering parameters be 0.09 and 2.3 in Oi, and 0.10 and 1.8 in Q 2 - For the 
measurements g\ and g 2 , we place a light source with formulation p = 1 — 10(x — l.l) 2 — 4 (y — 
0.45) 2 in the domain 



n* 



{{x,y) E Q, I (x - l.l) 2 /0.2 2 + (y- 0.45) 2 /0.5 2 = 1} 



and take the restriction on the boundary of the corresponding approximate state function 
obtained by the FEM for meshsize h = 0.0431 with 80384 elements and 40465 nodes as 2g. 
Again, fio — &■*, and we use \x = 4 in (5.4) for the stopping criterion. We plot the true light 
source density distribution p and an approximate one p^ ,h as well as their corresponding state 
in Figure 3 for p = 0.5 and h = 0.1336. Again we show the effect of the parameters p and 
h on the iterative number, the time our compute costs and the accuracy of the regularized 
approximate source in Tables 3 and 4. 
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Table 4: Iterative number, computation time and relative error E for different meshsize and 
for p = 0.50 in the second example 



References 



h 


0.4667 


0.2387 


0.1336 


0.0758 


iter-num 

cpu-time 

E 


1 
0.13 s 
2.16% 


2 
0.42 s 
0.86% 


3 

1.02 s 
0.53% 


3 
4.36 s 

0.41% 
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ABSTRACT 

For the problem given by x q Ut — {x 1 u x ) x = u p for < x < a, < i < T < oo, u (x, 0) = uq (x) for < x < a, 
and u (0, t) — — u{a,t) for < t < T, where q > 0, 7 € [0,1), p > 1, and uq (x) is a nonnegative 
function for < x < a, this paper studies existence and uniqueness of the classical solution u of the problem. 
Furthermore, the blow-up set of the solution is investigated. 
Key words: Degenerate parabolic problem; Comparison Theorem; Classical solution; Eigenf unction; Blow-up 

1. INTRODUCTION 

Let T < 00, q, 7, p, a be constants such that q > 0, 7 € [0, 1), p > 1, a > 0, D = (0, a), CI = D x (0,T), 
5 = [fl,o],fi = 5x [0,T), dQ = (5 x {0}) U ({0,a} x (0,T)), and Lu = a:<»u t - (a; 7 ^- The following 
degenerate semilinear parabolic first initial-boundary value problem is studied, 

Lu = u p in O, (1) 

u (x, 0) = u (x) on D,u(0,t)=0 = u (a, t) for i G (0, T) , (2) 

where uo (2;) is a nonnegative function such that uo (0) = = uq (a) and uq {%) € C 2+ " (Z)) for some 
a € (0, 1). The study of the problem (l)-(2) is motivated by the research papers of Chen, Liu, and Xie [5], 
and Floater [7]. Chen, Liu, and Xie studied the blow-up set of the problem (l)-(2) with a nonlocal source 
term L u p dx. They showed that u blows up in a finite time and the blow-up set is D. When 7 = 0, Floater 
studied the blow-up set ofuifl<p<<7 + l and uq (x) satisfies the condition -4- (uo (x) /x) < in D. 
He showed that if the solution of the problem (l)-(2) blows up in a finite time, then it blows up only at 
x = 0. When p > q + 1, Chan and Liu [4] proved that x = is not a blow-up point if uq (x) satisfies the 
condition u'q + Uq > Kuq in D for some positive constant K. In addition, they showed that the blow-up set 
is a compact subset of D. 

Without the source term u p , the problem (1) can be used to illustrate the heat conduction in a rigid slab 
whose faces at x — and x — a are in contact with a heat reservoir (cf. Day [6]). x q and x 1 are the heat 
capacity and the thermal conductivity of the slab, respectively. 
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When 7 = and q = 1, the problem (1) can describe the temperature u of the channel flow of a fluid with 
a temperature-dependent viscosity in the boundary layer (cf. Chan and Kong [3], and Ockendon [11]); here, 
x and t denote the coordinates perpendicular and parallel to the channel wall, respectively. When 7 = q, 
(1) is transformed into 

/ — '■y p 

X 

The behavior of the operator £,u — u t — u xx — "fu x /x was studied by Alexiades [1], and Chan and Chen [2]. 
In Section 2, we study existence and uniqueness of the classical solution of the problem (l)-(2). In Section 
3, we assume that uq (x) satisfies the following condition, 



xu' — uq < in D. 
Using an approach different from Floater, we show that u blows up only at x — when 1 < p < q + 1. 



(3) 



2. EXISTENCE AND UNIQUENESS OF THE SOLUTION 



Firstly, we prove a comparison theorem. 

Lemma 1. For any r G (0,T) and bounded nonnegative function B (x,t) on D x [0, t], if u and 

veC(Dx[0, r]) n C 2 - 1 {D x (0, r]), and 

(L- B)u>(L- B)v in Dx (0, r] , 

u > v on the parabolic boundary (Z) x {0}) U ({0, a} x (0,t]) , 

then u>v on D x [0, r]. 

Proof. Let w = u — v + e [l + x' 1-7 " 2 ] e ct where e and c are positive real numbers. Then, w > on 
(D x {0}) U ({0,a} x (0,t]). By a direct computation, 



(L - B) w = {L-B) (u - v) + {L-B)e 



l + x 



(l-7)/2 



>se c 



1+ 2.(1-7)72 



(cx* -B)+[ X —A ) x(- 3 " : -' 



As x — > 0, xS 3 +t)/ 2 — > 00. Let fci = max^^g^xp T ] B 7 and si denote the positive root of 



l- 7 



,(-3+7)/2 



1 + X 



(l-7)/2 



fci =0. 



Then, 



If Si < a, we choose 



(L - B) w > for (a;, i) e (0, si] x (0, r] . 
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Therefore, 

(L - B) w > in D x (0, r] . (4) 

Suppose that w < somewhere in D x (0, r], then the set 

{t € (0, r] : w (xo, i) < for some xo € -D} 

is nonempty. Let £ denote its infimum. Since w (x, 0) > on Z), < t < r. Let x denote the smallest iefl 
such that w (x, t) = 0. We have Wt (x, i) < 0. At i, u; attains its minimum at x, it follows that w^ (x, t) = 
and w xx (x, t) > 0. Therefore, at (x, i) 

(L — B)w (x, i) = x 9 w; t (x, i) — x 7 w xa: (x, i) — ■yx y ~ 1 w x (x, t) — B (x, i) u> (x, i) < 0. 

It contradicts (4). Hence, w > on D x [0, r]. As e — > + , u - v > on D x [0, r]. D 

Let (x) = x v (a — x) v where v G (0, 1) and v + 7 < 1, /i be a positive constant such that 

ho9 (x) > uo (x) on £>, (5) 

S be a positive constant less than a/2 such that there exists some to for which the initial value problem, 

[a- 6} h p (t) 
ti (t) = ^ J— for t € (0, t ] , h (0) = fto, (6) 

has a unique solution, and 

C - ^ p (a - 5) hP' 1 (t) > for t e (0, t ] , (7) 

where 

C = min \v (1 - 7 - 1/) <F+"- 2 (a - «)" , 1/ (1 - v) (a - sY^ S' y - 2 \ . 

Let il){x,t) = 8(x)h(t), lj = D x (0,t ], w = £> x [0,£ ], and <9w = (Z5 x {0}) U ({0,a} x (0,t ])- By the 
construction, ti (t) > for t e (0, t ] and V (a;, t) € C {Co) n C 2 ' 1 (w). 

Lemma 2. r/> (x, t) > u (x, t) on a). 

Proof. By (5), ip (x, 0) > u (x) on L>. Also, V (0, t) = and ip (a, t) = for t > 0. A direct computation 
gives 



Lip-ip p 

'n -tY h' (t\ -uh(t\ 

dx 



x q+v (a - x) v ti (t) - 1//1 (t) -^ x^" 1 (a - xf - x 7+,y (a - x) 1 



- x" p (a - xf p tiP it) 

= x« +,y (a - xf /i' (t) + v (1 - 7 - v) x' t+v - 2 (a - xf h (t) 

+ v (7 + 2v) x' 1 ^- 1 (a - x)^ 1 /i (t) + v (1 - 1/) x 7+J/ (a - xf~ 2 /i (t) 

- x vp (a - x)" p tiP it) . 
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When (x,t) e (o,5 x (0,t ], by (7) we have 



LiP~ij p >v{l- 1 -v)5 1 



+v-2 



-5) h{t)-5 vp (a-5) K p (t) 



Up 



>h{t) C - S up [a - 5) hP- x (t) 
>0. 

Similarly, when (x,t) € a — 5, a) x (0,to], 

LV - V > v (1 - v) (a - 5) 5 v - 2 h (t) - [a - 5) 5 vp h p (t) 
>h(t) \C - 5 vp (a - 5\ VP h p - 1 {t) 
>0. 

When (x,t) £ (S,a-d) x (0,t ], by (6) we obtain 

Lif, - f> > £<?+" ( a - 3)" ft' (t) - ^ ( a _ x f p h p (t) 
> <5 9+2,y /i' (t) - (a - 5) h p (t) 
= 0. 

By Lemma 1, ip (x,t) > u (x, t) on u. 



D 



To prove existence and uniqueness of the solution of the problem (l)-(2), let p (x) G C 1 (-D) be a nonde- 
creasing function such that p (x) = when x < and p (x) — 1 when x > 1. Let (5 be a positive real number 
such that J < a/2. We also let D5 = (5, a),Wj=fljX (0, t ], -D5 = [<5, a], and (D5 = Z),5 x [0, to]- In addition, 
let doj s = (D, x {0}) U ({(5, a} x (0,i ]), 



ps (x) = < 



P(f-l) 



x < 5, 

5 < x < 25, 

x > 25, 



and 



We note that 



d_ 
dS 



U<38 (x) = < 



uos (x) = ps (x) u (x) . 

0, x < 5, 

-fz P'(f -l)u (x), <5<a;<2<5, 

0, x > 2<5. 



Since p (x) is nondecreasing, it follows that 



85 



uos (x) < 0. 



(8) 
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As < ps (x) < 1, we have 

Uo (x) > Uos (x) for x € Us- 

Now, let us consider the following initial-boundary value problem, 



Lus = u p s in us, 



(9) 



us (x, 0) = uos (x) on Ds, us (S, t) = = us (a, t) for t € (0, to] 



(10) 



We want to show that the problem (9)- (10) has a classical solution us converging to a classical solution u of 
the problem (l)-(2) when S tends to 0. 

Lemma 3. The problem (l)-(2) has a unique nonnegative solution 

ueC(u)n C 2+a > 1+a/2 ((0, a] x [0, to]) . 

Proof. As ip (x, t) > us (x, t) on dus, by the maximum principle ip (x, t) > us (x, t) > on ug. We note 
that x-i+~< and a^+T- 1 e C a ' a / 2 (u s ) for some a E (0, 1). x~ q u p s < ipP/S q for some (x,t,u s ) e Us x R. It 
follows from Theorem 4.2.2 of Ladde, Lakshmikantham, and Vatsala [9, p. 143] that the problem (9)-(10) 
has a unique solution us € (j^+aS+a/2 (j-, g y jf ( j 1 > § 2 > o, it follows from (8) u^ < ug 2 in w^. Therefore, 
lim^^o us exists for all (x,t) € u). Let u(x,t) = limg^o ug (x,t) where -0 > it > on <D. We want to show 



biM 



[0, fi] and E = 



61,62 



[0,ii 



that u is a solution. For any (a?i,ti) € w, there exist sets E 

such that {x\,t\) E E d E d u (where 61 > 61 > and t\ < t\ < to)- Since u,5 < ■)/; in _E, we have for any 

constant q > 1, 



i- INH^) < ||VIIl«(A) 
ii. For t > 0, 



< /C2 for some positive constant k%, 






£«(-«+T-i)+i _ l«(-9+7-1)+i 

°2 "1 



1/8 



[«(-? + 7-l) + l] 



1/8 



-in- 



tends to when r approaches 0, 

hi. ||a5-Mllw(£)<6r 9 ll^llw(A)- 
If we choose 5 > 3/ (2 — a), by Theorem 4.9.1 of Ladyzenskaja, Solonnikov, and Ural'ceva [10, pp. 341-342] 



u s e W-' 1 (E). By Theorem 2.3.3 there [10, p. 80], W- 2 ' 1 (e) ^ #«.«/2 Lb). Thus, ||iti|| He ,, a/2 (£) < fc : 
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for some positive constant k%. Now, 

I I t, — <?„.?> I I 

\r u s\\h"^/ 2 (e) 



< b i \m\oo + sup 



(x,t)EE \ X X \ 



(x,t)EE 

\u p s (x,t)\\x-»-x-«\ x-d\u p 5 (x,t)~u p s {x,t)\ 
+ sup - - a h sup — ^ . 

(x,t)eE \ x x \ (x,t)eE \t~A 

(x,t)eE (x,i)eE 

By the mean value theorem, we have 

I |*-*u?l !*«,-/*(£) < K q IIVC +PbI q IMC'lMU^) + MIL \\x- q \\ Ha , a n { E)<^ 

for some positive constant k^ which is independent of 5. In addition, ||x ~ q+ ~'\\ Ha , a /2( E \ and \\jx ~ 9 + 7 _1 L fc , ]C ,/ 2 fp , i 
are bounded. Then, by Theorem 4.10.1 of Ladyzenskaja, Solonnikov, and Ural'ceva [10, pp. 351-352], we 
have 

\\ U 5\\ JJ2+a,l+a/2(ji;\ S ™5 

for some positive constant k§ which is independent of S. This implies that u$, (ug) t , (u$) , and {us) are 
equicontinuous in E. By the Ascoli-Arzela theorem, ||u||^ 2 +a, 1+0/2 ( E \ < &5, and the partial derivatives of u 
are the limits of the corresponding partial derivatives of us- Since tp > u > on Co, by the Sandwich theorem 
u(0,t) =0 = u(a,t) forte [0,t ]- Thus, ue C (w) n C 2+a ' 1+a/2 ((0, a] x [0,t ]). By Lemma 1, there exists 
a unique nonnegative solution u to the problem (l)-(2). D 

Let T = sup{*o : the problem (l)-(2) has a unique nonnegative solution u € C (Co)nC 2+a ' 1+a / 2 ((0, a] x [0, to])}- 
A proof similar to that of Theorem 2.5 of Floater [7] gives the following theorem. 
Theorem 4. The problem (l)-(2) has a unique nonnegative solution 

ueC (fi) n C 2+a ' 1+a/2 ((0, a] x [0, T)) . 

If T < 00, then u is unbounded in f2. 

3. BLOW-UP OF THE SOLUTION 

Let <f> (x) be the fundamental eigenfunction of the following Sturm-Liouville eigenvalue problem, 

(x 7 4>')' + \x q 4> = in D, 4> (0) = = <f> (a) , (11) 

where A is its corresponding eigenvalue. From the result of Chen, Liu, and Xie [5] , A > and 

, , s , 1=2 T I 2%A 1+2-7 \ 

(p (x) = k 6 x 2 J i- 7 — — a; 2 

9+2-7 I q + 2 - 7 / 
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where J(i_ 7 )/( g _|_2-7) ( z ) is the Bcsscl function of the first kind with order (1 — 7) / (q + 2 — 7), and </> (x) is 
positive in D for some positive constant k$. We choose ke such that max^gjj (/> (x) = 1. 

Let U (t) = J Q x q cf)udx. We modify the proof of Theorem 8 of Kaplan [8] to obtain the following blow-up 
result. 

Lemma 5. // U (0) > (Aa P9 ) '^ p , i/ien u Wows up m a finite time. 

Proof. Multiply 4* (x) on both sides of (1), we obtain 

x q u t (f) = {x 1 u x ) x 4> + U P (j). 
Integrate the above equation with respect to x from to a, and by (2) and (11) we get 

( f x q u4>dx\ > I (zV)' udx + J u p (-Y 4>dx 

f a f a ( x q \ p 

> — / Xx q (budx + / — d>u dx. 

~ Jo Jo \ aq J 

By Jensen's inequality, 

x q u<pdx > —A / x q 6udx + / — dmdx 
Jt Jo \Jo aq J 



This inequality is equivalent to 



rl .o-Mp- 1 )* 

^ (e^U) > e —— (e^Uf . 
dt y ' ~ aP q v ; 



Integrate the above expression from to t, it yields 

{e xt U(t)Y P+1 - U-p +1 (0) < ^— q |, Ai ! " : " - I 
which implies 



e A(p-i)t_ 



u l - p {t)<^— + (u l - p (0) - -?—) 

v ; ~ Xd pq \ v ' \aP q J 

By the assumption, U l ~ p (0) < 1/ (Xa pq ). Thus, U (t) tends to infinity in a finite time which implies that 
u (x, t) blows up in a finite time. □ 

Let 

I = xu x -u + ex r u m , (12) 

where r, to, and s are positive real numbers such that: 

(a) r > q + 2 - 7, 

(b) 1 < to < p, 
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(c) let e be a positive real number and 

(r — q — 2 + 7) (r + 7 + m) + r (m — 1) p — m 



e < min 



ma r (2r - q - 2 + 7) ' mai+ r - 2 (2r - q - 2 + 7) 

(d) by (3), we also choose e satisfying 

xuq — uq + ex r u™ < for x € D. 

Lemma 6. If p < q + 1 and no (x) satisfies (3), then I < on f2. 

Proof. Since lim^^o i«i = 0, lim x _>o -^ (#> £) = for t € (0,T). Define 7(0, t) — \im x ^o I (x , t) . Since 
u (x, t) > in Q and w (a, t) = 0, it follows that u x (a, t) < for t € (0, T). Then, I (a, t) < for t e [0, T). 
By the condition (d), I (x, 0) < for x <G D. Differentiate (12) with respect to i, it gives 

It — xu xt — ut + emx r u m ~ ut- (13) 

Similarly, 

Ix = xu xx + smx r u m ~ 1 u x + erx r ^ 1 u m . (14) 

By (12), this expression is equivalent to 

I x =xu xx + emx r - l u m - l I + e(m, + r)x r - l u m -e 2 mx 2r - 1 u 2m -\ (15) 

Differentiate (14) with respect to x, it yields 

Ixx — xu xxx + (l + £TOx r u m ~ 1 ) u xx + 2ermx r ~ 1 u m ~ 1 u x 
+em (m — 1) x r u m ~ 2 (u x ) + er (r — 1) x r ~ 2 u m . 
According to (13), (14), and (16), we have 

x*I t ~ (x<I x ) x 

= x (x q u xt — x 1 u xxx ) — x q u t + emx r u m ~ x q u t — x 1 u xx — emx r+1 u m ~ u x 
- 2srmx' l+r - 1 u m - 1 u x -em{m- 1) x~< +r u m - 2 (u x ) 2 - er (r - 1) x^ +r - 2 u m - 7 x 7 w a 



L XT 



- e 1 mx' t+r - 1 u m - 1 u x - e-irx'< +r - 2 u m . 

Differentiate (1) with respect to x, we obtain 

x q u xt -X 7 u xxx = -qx q ~ x u t + 2jx 1 ~ 1 u xx +7(7- l)x"<~ 2 u x + pu p ~ 1 u x . 

Using this expression and condition (b), we have 

x q I t - {x~<I x ) x 

< — (q + 1) x q u t + emx r u m ~ x q u t + (7 — 1) x 1 u xx + 7 (7 — 1) x 7 ~ u x + pu p ~ xu x 
- emx r+ ^u m - l u xx -e(2r + 7) mx' l+r - 1 u m - 1 u x - er (r - 1 + 7) x^ +r - 2 u m . 



(16) 
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From (1), x q u t = x~*u xx + 7a; 7 u x + u p . Then, 



x"I t ~ (X^I X ) X 



<-(q + l) (x~<u xx + 7a; 7 l u x + u p ) + 



emx u 



r „ m - 1 (rrl 



(x 1 u xx + 7X 7 1 u x + u p ) 



+ (7 - 1) x 1 u xx + 7 (7 - 1) x 1 1 u x + pu p 1 xu x - smx r+1 u m x u xx 



e (2r + 7) mx 1 u m u x — er (r — 1 + 7) x 



y+r-2 u m 



We simplify the above expression, and according to (12), it yields 

x"I t - (x^I x ) x 
< - (q + 2 - 7) x^u xx - 7 (q + 2 - 7) a; 7 " 1 ^ - (q + 1) m p + emx r u m+p - 1 



r - 2 u m . 



+ vu p - x (J + u - ex r u m ) - 2emrx t+T '- 1 u m - 1 u x - er (r - 1 + 7) x 7+ 

By (15) and (12), the above inequality changes to 

x*I t - (x~<I x ) x 

< - (q + 2 - 7) a; 7 " 1 [4 - emx r - l u m - 1 I -e{m + r) x r - l u m + e 2 mx 2T '- 1 u 2m - 1 ] 
- 7 (q + 2 - 7) x 7 " 2 (J + u - ea: r u m ) - (q + 1) u p + emx r u m+p - 1 + pu p - l I + pu p 



ipx r u m+p - 1 - 2emrx' 1+r - 2 u m - 1 (I + u - ex r u m ) - er (r - I+7) a; 7+ 



r - 2 u m . 



By assumption p < q + 1, it gives 

x«I t ~(x~<I x ) x + (q + 2- 1 )x-<- 1 I x 

< -em (2r + 7 - 9 - 2) x 7+r - 2 u m - 1 / - 7 (5 + 2 - 7) a; 7 " 2 / + pu p - x I 
—ex 1+r ~ 2 u m [(r — q — 2 + 7) (r + 7 + m) + r (m — 1)] 
+£ r2 ma; 7 + 2r '- 2 ii 2m - 1 (2r - g - 2 + 7) - e (p - m) x r u m+p - 1 . 

When < u < 1, — u 2 " 1-1 > — u m for m > 1. By condition (a), it yields 



£ 2 ,7+''-2„m 



w m [{r - q - 2 + 7) (r + 7 + to) + r (m - 1)] 



(17) 



+ e 2 mx~< +2r - 2 u 2m - 1 (2r - q - 2 + 7) 

< -ex 7+r_2 ii 2m_1 [(r - q - 2 + 7) (r + 7 + m) + r (to - 1) - ea r m (2r - g - 2 + 7)] . 

Choose e such that e < [(r — q — 2 + 7) (r + 7 + m) + r (m — 1)] / [a r m (2r — 9 — 2 + 7)], then (17) becomes 

x"I t - (x 7 4) x + (g + 2 - 7 ) x^- x I x 

(18) 

< -em (2r + 7 - q - 2) af+^u™- 1 / - 7 (9 + 2 - 7) x^ 2 I + pw^- 1 /. 
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When u > 1, u 2m_1 < w m +p- 1 for m < p. By condition (a), it gives 

e 2 mx' l+2r -' 2 u 2m - 1 (2r - g - 2 + 7) - i (p - m) x r u m+p - 1 
< ex r u m+p - 1 [ea? +r - 2 m (2r - q - 2 + 7) - (p - m)] . 

Choose £ such that £ < {p — m) / \a y+r ~ 2 m (2r — g — 2 + 7)] , we have 

E 2 m^ +2r - 2 « 2m - 1 (2r - q - 2 + 7) - i (p - m) /u"*^- 1 < 0. 

Therefore, (17) reduces to (18). Hence, if e satisfies condition (c), then (18) is true for u > 0. 

Let J = I — j]eJ 3t where 77 and /? are positive real numbers. Then, J (x,t) < on dfl, Jt = It — rifle"*, 
and J x — I x . From (18), we have 

x" (j t + V (3e fjt ) - (x~<J x ) x + (q + 2 - 7) x 7 " 1 J x 
< -7 (q + 2 - 7) x<- 2 (J + rie l3t ) + pvP' 1 (J + rje^) 
- em (2r - q - 2 + 7) x^- 2 !!™" 1 ( J + rje 13 *) . 

For any r £ (0,T), let M — max^ t)eDx\o r] u - By condition (a), we obtain 

x q J t - {x'< J x ) x + (g + 2 - 7) x 7 " 1 J, + 7 (g + 2 - 7) x 7 ~ 2 J - pvP- x J 
+ em (2r - g - 2 + 7) a^-^" 1 - 1 J 
< rye* 3 ' [-/3a; 9 - 7 (g + 2 - 7) x 7 ~ 2 + pM^ 1 ] . 

As x — > 0, a; 7-2 — ► 00. Let S2 denote the positive root of 

-7(g + 2-7)ic 7 - 2 +]9M p - 1 =0. 

If S2 < a, we choose /3 such that 

/3> — ^— . 

Therefore, for (x, £) € O 

> x« J f - (x 7 J x ) x + (g + 2 - 7) x 7 " 1 J x + 7 (g + 2 - 7) x 7 ~ 2 J - pu^ 1 J 
+ em (2r - g - 2 + 7) x 7+, '- 2 u" 1 - 1 J. 

Suppose that J > somewhere in fl, then the set 

{t e (0, T) : J (x 2 , i) > for some x 2 e D} 
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is nonempty. Let t denote its innnium. Since J (x, 0) < on D, < t < T. Let x 3 denote the smallest 
x G D such that J (x 3 , t) ~ 0. We have Jt (x 3 ,ij > 0. At i, J attains its maximum at x 3 , it follows that 
Jx (x 3 ,i) ~ and J xx (x 3 ,f) < 0. Therefore, at (X3, t) 

> x q 3 J t (x 3 , i) - xjj xx (x 3 ,i) + (q + 2 - 27) xj~ J x (x 3 ,i) 
+ 1 (q + 2 - 7) X T 2 'J (x 3 ,i) -p(u(x 3 ,i)) P J(x 3 ,t) 
+ im(2r-q-2 + 7) xj +r ~ 2 (u (x 3 , t)) m-1 J (x 3 ,t) 
>0. 

It leads to a contradiction. Therefore, J < on Cl. As 77 — > + , / < on f2. D 

Theorem 7. If p < q + 1 and uq (x) satisfies (3), then x — is the only blow-up point. 
Proof. According to Lemma 6, xu x — u < —ex r u m on f2. It implies 

x~ Hj; — .t~ u < — ex r ~ u m . 

It is equivalent to 

4- (x-'u) < -ix r+m - 2 (x-'u)™. 
ax 

Let Xi be a positive real number in (0, a]. For x G (0,X4), we integrate the above expression from x to 2:4, 

it gives 

(x^u(x 4 ,t)y m+1 - (x- l u{x,t))~ m+1 .(x^™" 1 -x r + m ~ 1 ) 



rn — 1 r + to — 1 

Suppose that w blows up at 0:4, then u (x^, t) — > 00 as t — > T _ . The left hand side of the above expression 
tends to a non-positive real number. However, the right hand side is a positive real number. It leads to a 
contradiction. Hence, x — is the only blow-up point. □ 
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Abstract 

As is well-known, the Gibbs' overshoot in approximating a step function is ir- 
reducible when using conventional Shannon sampling. Here, we consider a general- 
ization of Shannon sampling which allows samples to be taken on non-equidistant 
points, adapted to the behavior of the function. We show, numerically, that the new 
method allows one to reduce the Gibbs' overshoot. In a concrete example, the am- 
plitude of the overshoot is reduced by 70%. We study the underlying mathematical 
structure with a view to eventually determining the ultimate bound on how far the 
Gibbs' overshoot can be reduced. 
Key Words: Sampling, Shannon, Gibbs, Self-Adjoint Operators, Step Function. 

1 Introduction 

The Shannon sampling theorem was introduced into information theory by Shannon 
in 1949 [1]. It has since played a crucial role as the link between continuous and 
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discrete representations of information, finding ubiquitous use in communication en- 
gineering and signal processing. Already before Shannon, the theorem was studied 
by E. Whittaker and J. Whittaker in 1929, and it was also independently studied in 
the Russian literature by Kotel'nikov in 1933. Hence, it is also called the theorem of 
Whittaker-Shannon-Kotel'nikov (WSK). For a review, see [2, 3, 4], 

The Shannon sampling theorem states that if a function cj>(t) is O-bandlimited, 
i.e., 4>{t) has a frequency upper bound O, then cf>(t) can be perfectly reconstructed at 
all time t from its sample values {</>(£«)}„ taken on a set of sampling points {t n } n 
with an equidistant spacing t„+i — t n — 1/(20) via: 

oo 

4>{t)= E G(Mn)0(£n) (1) 

n — — oo 

The function G(t,t n ) is the so-called reconstruction kernel, and in the case of Shan- 
non, it is the shifted sine function sinc(20(£ — £„)) 'centered' at t n . The frequency 
upper bound O is called the bandwidth, and the sampling rate 1/(20) is referred to 
as the Nyquist rate. 

In addition to its use for the perfect reconstruction of functions in the space of 
O-bandlimited functions, the Shannon sampling theorem has also been widely used 
to approximate non-bandlimited functions. However, in this case, the Gibbs' phe- 
nomenon occurs whenever there is a discontinuous jump point leading, for example, to 
the Gibbs ringing problems in image compression. The clearest example to illustrate 
this type of overshoot is the step function H(t). See Figure 1. 



H(t) 



1 t > 

£ = 

-1 t < 



In Figure 1, the step function H(t) is approximated using Shannon sampling, i.e., 
as a sum of shifted sine functions. The equidistant sampling points are at integer 
multiples of the constant spacing As = 1/(20). Although the sampling density on 
the right (As = 0.1) is ten times tighter than the one on the left (As = 1.0), the 
maximum values of both approximating functions are about 1.0664 with an error 
of 0.001. We used 1000 terms in (1) in both cases. As is well known, the 6.64% 
difference to the original step function H(t), i.e. the Gibbs' overshoot, can not be 
further reduced when using Shannon sampling, even when increasing the bandwidth. 
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Figure 1 : Approximations of the step function by Shannon sampling. The left panel uses a wider 
sampling spacing of 1.0, while the right panel uses 0.1. 



In this paper, we use a generalized sampling theory [5, 6, 7] which allows the 
sampling and reconstruction of a function on a set of non-equidisant points adapted 
to the behavior of the function. We show that the new sampling method displays 
advantages when approximating a step function. See Figure 2. 

In Figure 2, we approximated the step function using the generalized sampling 
method. Outside the interval [—10, 10], the sampling points have the same constant 
spacing As = 1.0 as the ones on the left in Figure 1. But in the neighborhood interval 
[—10, 10] of the jump point, we adjusted the sampling density with 20 extra sampling 
points. As a result, the maximum value is reduced to 1.0193, which is a 70.9% of 
reduction in the amplitude of Gibbs' type of overshoot. The amplitude is subject to 
a numeric error of 0.001, which implies an error of 0.1% in the reduction percentage. 

The plot on the right in Figure 2 is a zoom-in of the plot on the left near the 
jump point. The solid line on the top is the amplitude of the Gibb's oveshoot on the 
uniform lattice in the case of Shannon (which is 1.0664), and the dashed line indicates 
the amplitude of maximum value with the new generalized sampling theorem (which 
is 1.0193). This indicates that the new method could be very useful, for example, 
to reduce Gibbs ringing in image compression. While we have observed the Gibbs 
overshoot reduction numerically, the analytic reasons and the ultimate limit for the 
Gibbs overshoot reduction still need to be understood. 

In the rest of this paper, we will therefore first recapitulate main features of the 
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Figure 2: Approximating the step function by the generalized sampling method with non- 
equidistant sampling points. The right plot zooms in near to the jump point. 

Shannon sampling theorem in Section 2, followed by the generalization in Section 3 
where we will show that it preserves the key features of Shannon sampling, except 
for the equidistance restriction on sampling points. In the end, we will show how 
to choose the non-equidistant sampling lattice adapted to the behaviour of a step 
function, and to obtain the result in Figure 2 by the generalized sampling method. 



2 The Shannon Sampling Theorem 

In this paper, we call a set of points where we take samples a sampling lattice. The 
Shannon sampling theorem does not specify where we should start to take samples, 
but requires that the distance between two adjacent points in one sampling lattice 
is precisely the constant Nyquist spacing 1/(2Q). Hence, we can parametrize all the 
sampling lattices as: 

*»(*) = ! ^r> o<£<i (2) 

Notice that as the parameter 9 increases from to 1, the sampling points {t n (8)} 

specified by 9 increase simultaneously and continuously with the following continuity 

property: 

t„(l) := lim t„(9) =t„+i(0) (3) 

9->i- 

In addition, we can differentiate the sampling points with respect to the parameter 
9 to obtain the velocity with which the sampling points are travelling along the real 
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line for increasing 9. The derivatives obey a similar continuity property: 

l im ^"W = dt n+1 (8) (4) 

e->i- dO dd e=0 

As a result, all the sampling points together, {t n (9)} n for all integer n and < 9 < 1, 
cover the real line exactly once. 

Of course, given any one-to-one and differentiable function 7 = 7(6) mapping 
[0, 1) onto [0, 1), we obtain a re-parametrization of the sampling lattices: 

Independent of which parameter we use, the one-parameter family of sampling lattices 
associated with the Shannon sampling theorem satisfies the properties mentioned 
above: as the parameter increases, the sampling lattices move to the right along the 
real line in a continuous manner so that, together, they cover the real line exactly 
once. 

For the particular parameterization with 9 in (2), the derivatives are precisely the 

Nyquist sampling rate: 

dt n (0) = _1_ 
dO ~ 20. 
This is not the case for a general parameterization. However, due to the Chain rule, 
the derivatives at a fixed lattice are always proportional, instead of equal, to the 
Nyquist sampling rate l/(2fi): 

dtuj-y) _ dtn(0) d&_ _ J_ dB_ J_ 

d-y ~ dO dy ~ 2fi d-y ~ 20 

This will be important later. 

So far, we saw that the Shannon sampling theorem possesses a natural one- 
parameter family of Nyquist sampling lattices {t n (9)} n . Using the reconstruction for- 
mula (1), any function in the space of f2-bandlimited functions can be reconstructed 
from its values taken on each fixed sampling lattice, namely, on {t„} n — {tn(9)} n 
for an arbitrary but fixed 9. In other words, treating the reconstruction kernel as a 
function in t, the following set of functions 

g ( J>\t) = G(t,t n (0)) 

forms an orthonormal basis of the function space and hence spans the function space. 
Most importantly, for each value of the parameter 9, from to 1, these bases span 
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the same function space. As we will see later, this is not trivial, but achievable when 
we generalize to non-equidistant sampling lattices. 

Further, on one fixed lattice, the basis functions interpolate all the points in that 
lattice: g„ (t m (0)) = 8 nm . More importantly, concerning of stability of reconstruc- 
tion, notice that the maximum value (or the maximum of the absolute value) of these 
basis function is always 1 at the point about it is 'centered'. 

3 The Generalized Sampling Method 

The generalized Shannon sampling theory [5, 6, 7] preserves the key features men- 
tioned in the preceding section, including a one-parameter family of sampling lattices 
(with non-equidistant sampling points in general), which covers the real line exactly 
once, the same form of reconstruction formula as in (1), and the corresponding recon- 
struction kernel (or sets of basis functions) with similar properties as the sine kernel 
(or sine functions). 

This generalized sampling theory was initially motivated by the study of the 
physics of space-time in quantum gravity [8] . The underlying mathematics is based 
on the functional analytical theory of self-adjoint extensions of unbounded symmetric 
operators, which will be briefly mentioned at the end of this section. 

While generalized Shannon sampling is optimized for non-uniformly spaced sam- 
pling points, it is significantly different from conventional non-uniform Shannon sam- 
pling [9]. Conventional non-uniform sampling still works within same bandlimited 
function space as in the case of Shannon, and merely seeks to reconstruct a function 
in that space from samples taken on a set of non-uniform points. In this case, re- 
construction from the set of uniform sampling points is optimally stable among all 
lattices with the same average sample density. The stability of conventional sam- 
pling from non-uniform points can be greatly reduced, as is illustrated by the case of 
the so-called superoscillatory functions [10]. In approximating a step function, since 
the conventional Shannon non-uniform sampling uses the same bandlimited function 
space, the Gibbs' oveshoot can not be made smaller even when using non-uniformly- 
spaced sampling points. 
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Figure 3: Examples of generalized sine functions (or reconstruction kernel) on a random non- 
equidistant sampling lattice 



3.1 The Generalized Reconstruction Kernel 

In the generalized sampling theory, we still deal with one-parameter families of sam- 
pling lattices, say {t n (a)} n - Now, however, they can possess non-equidistant sam- 
pling points. We use the parameter a to distinguish from the parameter 8 used in 
the Shannon sampling theory. Together, the sampling lattices still cover the real line 
exactly once, and they are differentiable with respect to the parameter a. We denote 
the derivative at a point t — t n (a) by: 

The family of sampling lattices still satisfies the continuity conditions (3) and (4). 
On each fixed lattice, i.e. for a fixed a, let t n = t n (a), t'„ = t' n (a). Then the 
reconstruction kernel on this lattice was found to read: 

<*'»> =<-»■"•-' ^(e^)"" m 

where z(t, t n (a)) is the number of the sampling points {t m (a)} m between t and t n (a) 
exclusively. 

As a function of t, for each fixed a, the set of basis functions 



{gl?\t) = G(t,t n (cx))} n 



spans the same function space. Of course, it is generally no longer the space of 
bandlimited functions. These continuous functions have analogous properties to those 
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of sine function: they interpolate all the points in the lattice specified by a 
ffi a) (^m(a)) = G(t m (a),tn(a)) = 8 mn 

and their maximum values are all 1 at the sampling points about which they 'cen- 
tered'. Hence, we refer to these basis functions as generalized sine functions. Sec 
Figure 3 for examples. 

The fact that each set of basis functions specified by a fixed a spans the same 
function space is a remarkable property since, as Figure 3 shows, the shapes of these 
generalized sine functions are quite non-trivial. 

Let {in (7) = tn(&('y))} n be a re-parametrization of {tn(a)} n where a — 0(7) is 

a differentiable and strictly increasing function mapping [0, 1) onto [0, 1). For each 

value of a, there is one and only one value of 7 such that t n (a) = t n (7) for all n. Hence 

{tn(a)} n and {tn("/)} n represent the same sampling lattice, although the derivatives 

with respect to each parameter are different. However, because we are working on 

the same family of sampling lattices, independent of the choice of parameterization, 

we expect the reconstruction kernel on a fix lattice {t n (a)} n = {tn('~f)} n to be the 

same, and hence giving the same function space. This is indeed the case. To see this, 

use the Chain rule: 

dt„(-y) _ dt„(a) da 
d'y da d-y 

We notice that the derivatives on the fixed lattice respect to different parameters 

differ only by an n-independent constant C = 4^. Substitute Ct'„(a) into (5). The 

constant C in the y / i^(a)-term in the numerator in front of the infinite sum cancels 

out the one in t' m (a)-term inside the summation since there is a power of —1/2 over 

the summation. 

Therefore, the reconstruction kernel and the set of basis functions on a fixed lattice 
is invariant under a scalar multiplication of the derivatives, or a re-parametrization 
of the sampling lattices. As we will see later, this simplifies the work on finding 
derivatives for a given lattice. 

To recover the Shannon sampling theorem as a special case, we choose any uniform 
sampling lattice {t n } n with t n +i—t n = 2TT f° r al l n i together with constant derivatives 
t'„ = C. Then the reconstruction kernel in (5) simplifies to the sine kernel sinc(2f2(£ — 
tn)), by using the following trignometric identity: 

n 2 + °° 1 

sin 2 (7^) " f- {z-kf (6) 
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3.2 One Sampling Lattice and its Derivatives 

For practical purposes, one usually only uses one sampling lattice instead of the whole 
family of lattices, given that one lattice is enough for the reconstruction or interpo- 
lation. For example, to approximate a step function, we only need to interpolate 
samples on one fixed sampling lattice. Hence, it is natural to ask if specifying the 
whole family of sampling lattice is equivalent to specifying just one, say {tn(0)} n . 
The answer is yes if the derivatives {t'„(0)} n on that lattice are also specified. 

Indeed without loss of generality, if we are given the sampling lattice and the 
derivatives at a = 0, one can generate the whole family of sampling lattices {t„(a)} 
by solving for t = t n (a) in: 

V tm( -°] r = 7T cot(Tra) (7) 

m 

This equation is derived directly from the theory of self-adjoint extensions [5]. Hence 
it holds for the particular parameterization a that arises in this way. We will not 
further discuss the details, but emphasize the fact that finding one sampling lattice 
and associated derivatives are sufficient to apply the generalized sampling theorem. 
But which {t n } n and {t' n } n can arise in the generalized sampling theory? 

Arising from the theory of self-adjoint extensions, we have to require the following 
two restrictions when choosing a sampling lattice {t n } n - First, there is a minimum 
and maximum spacing, namely, there exist positive real numbers <5 m i n and A max such 
that: 

< <$min < At n = tn+i - t n < A max for all n (8) 

In addition, the sampling points {t„} n and the corresponding derivatives {£„}„ have 
to satisfy: 

With this initial set of data, {£„ = t n (0)} n and {t' n = t' n (0)} , one can use (7) to 
generate the whole family of sampling lattices {t„(a)}, in which each lattice also 
obeys (8) and (9). 

The question now is, once we have a sampling lattice {t„} n obeying (8), what 
would be a suitable choice of the associated derivatives t' n ? 

First, a derivative t' n (a) is the velocity with which the sampling points t n (a) are 
moving to the right along the real line for increasing a at t = t n (a). Hence a good 
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candidate for t' n is the distance travelled in one period of a, which is the spacing 
between two adjacent points. For symmetry, we set t' n to be proportional to the 
average distance between t n to its previous and successive points: 

t' n ~ - (Atn + At„_l) = - (tn+l - *n-l) (10) 

The multiplicative constant can be obtained through (9). However, since the recon- 
struction kernel is independent of a scalar multiplication of the derivatives, we will 
ignore this scalar prefactor, and use t n +i — t n -i directly as t' n in approximating a 
function. 

3.3 Generalizing Principles 

The proof of the generalized sampling theorem is outlined in [5, 6]. A more detailed 
mathematical proof can be found in authors' upcoming paper [7], and the general 
theory of self-adjoint extensions can be found in [11, 12]. Here, we will outline the 
idea of this generalization from the functional analytical point of view. 

In the Shannon sampling theorem, the set of shifted sine functions span the 
space of fi-bandlimited functions. In frequency space, namely, by taking the Fourier 
transform, the function space corresponds to L 2 {[— Q., $7]), which has a dense subset 
AC (— Cl, fi), the set of absolutely continuous functions on [— fi, fi] with a vanishing 
boundary condition. The dense subset AC (— fi, O) is an invariant domain under the 
action of the differential operator D :— i4-'- 

$(w) g Ac°{-n,n) =* d$(cj) = i$'(w) g Ac"(-n,n) 

The operator D is a simple symmetric operator with deficiency indices (1,1), and 
it has therefore a [/(l)-family of self-adjoint extensions. Each self-adjoint extension 
has a set of discrete eigenvalues t n (9) = ^i- , which corresponds the set of sampling 
points. Further, each eigenvalue i„ has an eigenfunction, which corresponds to the 
basis function. 

The differential operator D is the multiplicative operator T, T(j>(t) = t<f>(t), in the 
time domain, which is also simple symmetric with deficiency indices (1,1). In the 
case of Shannon, the operator T is special since all its self-adjoint extensions have 
equidistant eigenvalues. 

In the generalization, we consider a generic symmetric operator T without equidis- 
tant restriction. In functional analytical terms, the sampling theorem expresses the 
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fact that if a Hilbert space vector is known in the eigenbasis of one self-adjoint ex- 
tension of T, then the vector's coefficients in the eigenbases of all other self-adjoint 
extensions of T are also determined. Each of those spectra constitutes a set of in 
general non-equidistant Nyquist sampling points. The sampling kernel then consists 
of the matrix elements of the unitary transformations which map in between the 
eigenbases of the self-adjoint extensions of T, and the function space is the closure of 
the domain of T. 

4 Approximating a Step Function 

In summary, to use our generalized sampling theorem to approximate a function, the 
strategy is: 

1. Find a set of points {t n } n to form a sampling lattice, depending on the behavior 
of the function to be approximated. The sampling points have a minimum and 
maximum spacing (8); 

2. Set the corresponding derivatives {t' n } n to be |(£ n +i — t n -i) as in (10); 

3. Construct the approximating function by the reconstruction kernel (5) and re- 
construction formula (1). 

The difficult task is the first step: given a function, how to choose a suitable set of 
sampling points? 

In approximating the step function H{t), we have seen that uniform lattices are 
not optimal, because the step function has a sudden change at the jump point, i.e. a 
large Gibbs overshoot. Intuitively H(t) has infinitely large bandwidth at t = 0, while 
it has zero bandwidth elsewhere. A windowed Fourier transform shows a regularized 
behavior of high bandwidth at the step and decreasing bandwidth away from the 
step. Recall that in the Shannon case, the constant derivative t' n ~ ^ is inverse 
proportional to the bandwidth Q. Hence, the constant derivatives on a uniform 
lattice are not matched up with the jump in H(t). 

When we take samples within the generalized sampling theorem, the derivative, 
which controls the sampling density, must match the varying bandwidth. Hence, we 
expect to take 

• more samples (at a higher rate) when the function oscillates faster, and 
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• less samples (at a lower rate) in the period of time with less oscillations. 

Thus, for a step function, the spacing between sampling points should be small 
near the jump point t — where the function has a high bandwidth, and gradually 
increases to some constant spacing when we sample far away from t — 0. 

The easiest gradually increasing spacing is the linear one. Hence, we will use the 
sampling lattice with linearly increasing spacing in approximating the step function 
H(t) in this paper. We expect this non-equidistant sampling lattice will give a better 
approximation than the case of Shannon, because now the sampling density matches 
better with the behavior of the step function H(t). 

Because H(t) is anti-symmetric about t — 0, the sampling points should also be 
symmetric with respect to t — 0. Let to = be one of the sampling points, then 
£_fc = — tk for all k > 1. From now, we can therefore focus on finding the positive 
sampling points. 

Assume that outside the interval [— MAs, MAs], we have equidistant sampling 
points with a constant spacing As: 

Sm = mAs V \m\ > M 

where M is some positive integer. Inside this interval, we have non-equidistant sam- 
pling points with linearly increasing spacing. Assume that we have K such positive 
non-equidistant sampling points. Let Ar be the first spacing between to and ti, and 
5 be the linear spacing increment. Then 

ti = t + Ar = Ar 

t 2 =t! + (Ar + 8) = 2Ar + S 

t 3 = ti + (Ar + 25) = 3Ar + 35 



tK-i = ta-i + (Ar + (K - 2)8) = (K - l)Ar + ±(K - l)(K - 2)8 
t K = tK-i + (Ar + (K- 1)8) = KAr + \k(K - 1)8 
The chosen sampling lattice is 

The largest non-equidistant sampling point tx must match the equidistant sampling 
point sm = MAs, and the spacing must gradually increase to the constant spacing 
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As. Hence 

t K = KAr + -K(K - 1)8 = MAs (11) 

{tK~t K -i) + S = Ar + K8 = As (12) 

The difference of 2 times (12) and (K — 1) times (12) gives: 

(K + l)Ar = (2M - if + l)As (13) 

Notice that Ar is (and should be much) smaller than As. By (13), this means that 

(K+l)> (2M -K + l) =* M <K 

The smaller Ar, the better approximation. So we want to choose K as large as 
possible. Re-write (13) as 

K = (2M + DAS-A, = _ 2(M + l) Ar 

As + Ar y As + Ar v ' 

All the terms are positive, so K < 2M + 1, and hence the maximum possible value 
of K is 2M. With K = 2M, we have from (14): 

2 [ M + P Ar = 1 =* As = (2M + l)Ar 
As + Ar v ' 

Substituting this and K — 2M to (11), we have 

Ar + (2M)8 = (2M + l)Ar =>• 8 = Ar 

In summary, for an optimal result to approximate a step function H(t) using this 
linearly increasing spacing method, we pick an interval [—MAs, MAs] from the uni- 
form lattice with a constant spacing As, and replace the uniform sampling points in 
that interval by a set of points whose adjacent spacing is linearly increasing. There 
will be twice as many of points as with the uniform lattice, i.e. K = 2M, and 
8 = Ar = 2j J 1 As. In other words, only the M and As are free choices. 

In Figure 2, the step function H(t) is approximated by this method with M — 10 
and As = 1.0. Comparing this lattice with the uniform lattice with same As, we sim- 
ply replaced the equidistant sampling points on [—10, 10] by a set of non-equidistant 
point, whose adjacent spacing linearly decreases toward the jump point t — 0. As 
mentioned in the introduction, the maximum value of the new approximation is re- 
duced to 1.0193 with an error of 0.001. This gives about 70 — 71% of reduction in 
Gibbs' overshoot. 
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It is important to point out that, using the method of replacing the equidistant 
points in a finite interval on a uniform lattice with a set of non-equidistant points, the 
reconstruction kernel can be given in a closed form: using the same trigonometric 
identity (6) used for recovering the special case of Shannon, we can simplify the 
infinite sum in (5) into a number subtracting finitely many equidistant points and 
adding finitely many non-equidistant points. Hence the truncation error only arises 
from the reconstruction formula in (1), but not from the reconstruction kernel in (5). 

5 Outlook 

Using a non-equidistant lattice with the sampling density linearly increasing toward 
the jump point, we have seen a significant 70% reduction of the Gibbs' overshoot. 
Naturally, the question arises whether one can do better with a differently spaced 
non-equidistant sampling lattice. Is the linear change in sampling density the optimal 
lattice spacing to match the behavior of a step function? We will address these 
questions with a more detailed analytical study of the reconstruction kernel. 

The best result would be the elimination of all the overshoot. This may not be 
achievable since the generalized sampling theory considered here assumes that there 
is no accumulation point of the eigenvalues of these self-adjoint extensions. This 
corresponds to the assumption of a finite minimum distance in any sampling lattice. 
However, a further study of the theory of self-adjoint extensions with accumulated 
eigenvalues may solve the problem. 

For practical purposes, the ultimate goal is for any given random function to have 
an algorithm to determine an optimal sampling lattice, so that the function is (within 
a pre-specified error) in the function space specified by the lattice. 
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Abstract 

In this paper, wc present a method to solve the delay differential equations. First, 
we convert the delay differential equations to power series, then we transform the power 
series into Pade series form, which gives an arbitrary order for solving delay differential 
equations. The advantages of using of the proposed method are presented. In the 
sequel, presented numerical solutions of some experiments and comparing with several 
other methods illustrate the high accuracy and efficiency of the proposed method. 
Also, the experiments show that the method works well for non-linear and problems 
on large intervals, too. 

Keywords: Delay differential equations; Arbitrary order; Power series; Pade series. 
2000 Mathematics Subject Classification: 65N; 65L10; 65N55. 

1 Introduction 

Delay differential equations (DDEs), arise in many areas of various mathematical 
modeling. For instance, infectious diseases, population dynamics, physiological and phar- 
maceutical kinetics and chemical kinetics, the navigational control of ships and aircraft 
and control problems. There are many books on the application of DDEs which we can 
point out to the books of Driver [5], Gopalsamy [8], Halanay [9] and Kuang [10]. Many 
different methods have been presented for numerical solution of DDEs such that we can 
point out to the Bellman's method of steps [3], waveform relaxation method [6], Runge- 
Kutta method and continuous Runge-Kutta method ([1] and [2]), spline methods ([11] 
and [12]) and Adomian decomposition method [13]. 
Out of these methods, we desire to solve the following DDE by the proposed method. 

y'(t) = f(t,y(t),y(t-T 1 (t,y(t))),...,y(t-r m (t,y(t)))), t > t , (1) 

y{t)=cp{t), t<t , 
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where / : [to,oo) x M m+1 — ► M is a smooth function and Ti(t,y(t)),i = 1,2, ... ,m, are 
continuous delay functions on [to, oo) x R. Also <f>(t) represents the initial function or the 
initial data. 

The proposed method is easy with high accuracy and efficiency Also, this method 
produces an approximate polynomial on the interval only in a few terms. For instance, 
in order to discuss about the efficiency of the proposed method we sate its advantage 
with respect to Runge-Kutta method which is the most important method for numerical 
solution of DDEs. Runge-Kutta method like as all discrete variable methods produce 
approximations y n to y(t n ) on a mesh t n in the interval and starting with the given initial 
value, yo = y(to) at *o and stepping from y n « y(t n ) a distance of h n to y n +\ ~ y(t n +i) 
at t n +i = t n + h n . The step size h n is chosen as small as necessary to get an accurate 
approximation. It is chosen as big as possible so as to reach the end of the interval in 
as steps as possible. In the original form, Runge-Kutta method produces answers only 
at mesh points, but our method produces an approximate polynomial on the desired 
intervals which gives the desired accuracy in a few terms, only. Thus the computations 
will be decreased and the main difference between this method and Runge-Kutta method 
is observable. 

The organization of this paper is as follows: Section 2 is devoted to introduce the pro- 
posed method and then we transform the approximate polynomial gained by the proposed 
method to Pade approximate series. In section 3, we consider the error estimation of the 
method and In section 4, we present some experiments in-which their numerical results 
illustrate the high accuracy and efficiency of the method for non-linear and problems on 
large intervals. Finally, the last Section, consider some conclusions. 



2 The proposed method 

For solving equation (1), without loss of generality we assume that to = (every 
interval [to, oo) can be mapped to the [0, oo), easily). 
First suppose that {t — Tj(t, y(t))}^ =0 can be expanded as 

s j 
t-T j (t,y(t)) = Y,Pk j (t)y k (t), j = 0,l,...,m, (2) 

fc=0 

where Ttj{t) are polynomial functions. Also, suppose that fit, yit),yit—T\(t, y{t))), . . . , y(t- 
T m (t, y(t)))) can be expanded as 

f(t, y(t),y(t - n(t, y(t))), ...,y(t- r m (t, y(t)))) = 

no n\ n m 

E E • • • E &o,ii,.,i m (t)v io (t)v il (t - n{t, y{t))) ...y^(t- r m (t, y(t))), (3) 

io=Oh=0 i m =0 

where /3z ,u,...,i m (£) are polynomial functions. Now, we assumed that yo = y(0) = </>(0) and 
the approximate solution is 

yi(t) = y + et, (4) 
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where e is a coefficient which obtain as follows: 

We substitute (4) in (1) and by implementing (2) and (3) we gain a polynomial to approx- 
imate the expression 

f(t, y + et, y + e(t - n(t, y + et)), ...,y + e(t- r m (t, y + et))). (5) 

Suppose the approximate polynomial is in the form of 

/o + fit + f 2 t 2 + ... + f k t k + ... + f M t M , (6) 

where M is the maximum power of t in computations. Also, y'it) = e. Thus, we obtain 
an approximation of equation (1) as 

e = /o + fit + H 2 + • • • + fkt k + • • • + f M t M , (7) 

neglecting of higher order terms, we have 

e - /o + 0(t) = 0, (8) 

therefore, we obtain e = /o- Let us suppose that e = y±. Thus, we have the following 
approximation of order one, 

yi(t)=yo + yit- (9) 

In next step, we assume that new approximate solution is 

m(t) = y + yit + et 2 . (10) 

In the same way and neglecting of higher order terms (here 0(t 2 )), the value of e will be 
obtained. Repeating the above procedure for aforesaid term and higher terms, we can get 
the arbitrary order power series of the solutions for equation (1) as 

Vn(t) = vo + yit + y 2 t 2 + • • • + y n t n . (li) 

Also, the power series given by the above procedure can be transformed into Pade series, 
easily. 

Generally, Suppose that we are given a power series Xl£o a *^' representing a function 
f(t), so that 

oo 

/(*) = X>*\ (12) 

i=0 
A Pade approximate is a rational fraction 

[L/M] = Po+ P it + ...+ PL t L 

L ' J qo + qlt + ... + qMt M, V ' 

which has a Maclaurin expansion which agrees with (12) as far as possible. Notice that in 
(13) there are L+ 1 numerator coefficients and M + 1 denominator coefficients. There is a 
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more or less irrelevant common factor between them, and for definiteness we take qo = 1. 
This choice turns out to be an essential part of the precise definition and (13) is our 
conventional notation with this choice for qo . So there are L + 1 independent numerator 
coefficients and M independent denominator coefficients, making L + M + 1 unknown 
coefficients in all. This number suggests that normally the [L/M] ought to fit the power 
series (12) through the orders 1, t, t 2 , . . . , t L+M in the notation of formal power series, 

Ya t ?= P0+Plt + --- +PLtL M + O(t L + M ^). (14) 

^ q + qit+... + q M t M V ' V ' 

Multiply the both side of (14) by the denominator of right side in (14) and compare the 
coefficients of both sides in (14). We have 

M 

ai + ^a^kqk = p h l = 0,l,...,M, (15) 

fc=i 

L 

ai + Y^ai-kQk = 0, l = M + l,...,M + L. (16) 

fc=i 

Solving the linear equation in (16), we have qk,k = 1, . . . , L; and substituting q^ into (15), 
we obtain pi for all I = 0, . . . ,M, such as [4]. Therefore, we have constructed a [L/M] 
Pade approximation, which agrees with Y^i^o a *^ through order t + . If M < L < M + 2, 
where M and L are the degree of numerator and denominator in Pade series, respectively, 
then Pade series gives an A-stable formula for an ordinary differential equation [14] . 



3 Error estimation 

In this section, an error estimation for the approximate solution of (1) is obtained. Let 
us call e n (t) = \y(t) — y n (t)\ as the error function of the approximate solution y(t) to y(t), 
where y(t) is the exact solution of (1). Hence, y(t) satisfies the following problem: 

y'n(t) = f(t, yn(t),y n (t - n(t, y n (t))), . . . , y n (t - r m (t, y n (t)))) + K n (t), t > t . (17) 

The perturbation term K n {t) can be obtained by substituting the computed solution y n {t) 
into the equation 

K n (t) = y' n (t) - f(t,y n (t),y n (t - n(t,y n (t))), . . . ,y n (t - T m (t,y„(t)))), t>t . (18) 

We proceed to find an approximation e n {t) to the error function e n (t) in the same way as 
we did before for the solution of problem (2). Subtracting (18) from (1), the error function 
e n {t) satisfies the problem. 

e n (t) - f(t, e n (t),e(t - n(i, e n (t))), ...,e{t- r m (t, e n (t)))) = -K n (t), t > t . (19) 

It should be noted that in order to construct the approximate e n {t) to e„(t), only the 
equation (19) needs to be recomputed in the same way as we did before for the solution 
of problem (1). 
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4 Numerical experiments 

In this section, we present four experiments in-which their numerical results state the 
high accuracy and efficiency of the proposed method. Problems 4.1 and 4.2 are chosen so 
simple, but Problems 4.3 and 4.4 are nonlinear and almost complicated. In these problems 
we observe the high accuracy and efficiency of the method, obviously. 



Problem 4.1 Consider the following DDE [11, 12, 13], 

{ y'{t) = \e t 2y^) + \y{t), < t < 1, 

\ y(0) = 1. 

The exact solution is y{t) = e t . 

We have obtained 17 terms of the approximate solution. Comparing the results with other 
methods [11, 12, 13], we found that Pade approximate solution is the best as shown in 
Table 1. 



Table 1: Comparison of errors between some methods and Pade approximate solution 



u 


h = 0.001 [12] 


h = 0.001 [11] 


ADM [13] 


Pade approximate 
solution 


0.1 


8.23 x 10~ 12 


1.12 x 10~ 15 


1.30 x 10~ 15 


0.00 


0.2 


1.37 x lO" 11 


3.10 x 10~ 15 


1.40 x 10~ 15 


1.00 x 10~ 15 


0.3 


2.31 x lO" 11 


4.81 x 10~ 15 


1.50 x 10~ 15 


1.00 x 10~ 15 


0.4 


3.27 x 10~ n 


7.54 x 10~ 15 


1.90 x 10~ 15 


1.00 x 10~ 15 


0.5 


4.48 x 10~ n 


9.73 x 10~ 15 


2.10 x 10~ 15 


0.00 


0.6 


5.86 x lO" 11 


1.39 x 10~ 14 


2.20 x 10~ 15 


0.00 


0.7 


7.43 x lO" 11 


1.76 x 10~ 14 


2.40 x 10~ 15 


1.00 x 10~ 15 


0.8 


9.54 x 10~ n 


2.13 x 10" 14 


2.60 x 10" 15 


0.00 


0.9 


9.87 x 10- 11 


2.84 x 10~ 14 


2.20 x 10~ 15 


2.00 x 10~ 15 


1.0 


1.43 x 10~ 10 


3.19 x 10" 14 


2.60 x 10~ 15 


2.00 x 10~ 15 



Also, during the running of program we find out the run time of Pade approximation is 
0.451 second and run time of the Adomian decomposition method is 12.047 seconds. Thus 
in the case of numerical solution of DDEs, we prefer Pade approximation to the Adomian 
decomposition method. 

Problem 4.2 Consider the following DDE [13], 

y >(t) = l-2y 2 (±), 0<t<l, 
y(0) = 0. 

The exact solution is y{t) = sin{t). 

The approximate solution is obtained with n = 16 is given in Table 2. 
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Table 2: Exact solution, Pade approximate solution and the error value of y(t) 



y(U 



y(U 



\y(tj) - y(U 
o 







1 x 1(T 16 

i x i(r 16 
5 x i(r 16 

2.2 x 1(T 15 



0.0 1 

0.1 0.0998334166468281 

0.2 0.1986693307950612 

0.3 0.2955202066613396 

0.4 0.3894183423086505 

0.5 0.4794255386042030 

0.6 0.5646424733950354 

0.7 0.6442176872376911 

0.8 0.7173560908995228 

0.9 0.7833269096274834 

1.0 0.8414709848078965 



1.0000000000000000 
0.0998334166468281 
0.1986693307950612 
0.2955202066613396 
0.3894183423086505 
0.4794255386042030 
0.5646424733950354 
0.6442176872376910 
0.7173560908995227 
0.7833269096274829 
0.8414709848078937 



In the two previous experiments you observed that, this method of solution produces an 
approximate solution in minimum number of terms and acts accurate. This exhibits one 
of the main advantages of the method in spite of its simplicity. 



Problem 4.3 Consider the following DDE, 

y'{t) + sin{t)y 2 {t) + e t y(^-)y{t - sin{t)) = 2tsin(t) + t 2 cos{t) + 



t ( t 2 sinjt) ^2 



+sin(t)(t 2 sin(t)) 2 + e*( — T 
y(t) = t 2 sin(t), t < 0. 



sin i 



t 2 sin(t) 



)(t — sin{t)) 2 sin{t — sin{t)) , t > 0, 



The considered delays are 



n{t, y {t)) = t 



y(t) 



,T 2 {t,y(t)) = sin(t). 



The exact solution is y{i) = t 2 sin(t). The approximate solution is obtained with n = 52, 
and the results in the interval [0, 10] are given in Table 3. 
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Table 3: Exact solution, Pade approximate solution and the error value of y(i) 



ti 


v(U) 


v(U) 


\y(U) - 


-y(U)\ 








0.000000000000000 









.5 


0.119856384651050 


0.119856384651050 









1 


0.841470984807896 


0.841470984807896 









1.5 


2.244363719859122 


2.244363719859122 









2 


3.637189707302727 


3.637189707302727 









2.5 


3.740450900649728 


3.740450900649728 









3 


1.270080072538805 


1.270080072538805 









3.5 


-4.297094539197843 


-4.297094539197843 









4 


-12.10883992492685 


-12.10883992492685 









4.5 


-19.79498488271822 


-19.79498488271822 









5 


-23.97310686657846 


-23.97310686657846 









5.5 


-21.34259484850435 


-21.34259484850436 


1 X 


10" 


-15 


6 


-10.05895793516133 


-10.05895793516133 









6.5 


9.088819496710205 


9.088819496710206 


1 X 


10" 


-15 


7 


32.19234333722067 


32.19234333722067 









7.5 


52.76249869357906 


52.76249869357906 









8 


63.31892778389644 


63.31892778389643 


1 X 


10" 


-15 


8.5 


57.69069388704717 


57.69069388704717 









9 


33.38159730458228 


33.38159730458228 









9.5 


-6.782388621678290 


-6.782388621678286 


4 x 


10" 


-15 


10 


-54.40211108893698 


-54.40211108893692 


6 x 


10" 


-15 



In this experiment, the efficiency of the method on large intervals is observable. 



Problem 4.4 Consider the following DDE, 



y'(t) + y 3 (y(t)) + y(y 3 (t)) = 2te~ 2t - 2t 2 e - 2t + ((^ e -2* ) 2 e -2(^e-) ) 3 + 

+((t 2 e- 2i ) 3 ) 2 e- 2 (' 2e " 2t ) 3 , t>0, 
y (t) = t 2 e- 2t , t < 0. 



The considered delays are 

n(t,y(t)) = y(t),T 2 (t,y(t)) = y 3 (t). 

The exact solution is y(t) = t 2 e~ 2t . 

The approximate solution is obtained with n = 40, and the results in the interval [0, e] are 

given in Table 4. 
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Table 4: Exact solution, Pade approximate solution and the error value of y(t) 



y(U 



y(U 



\y(U) - y(tj)\ 



o o 

.le 0.04290244113814 

.2e 0.09964030214245 

.3e 0.13016990689938 

.4e 0.13436343801038 

.5e 0.12189732467981 

.6e 0.10191772165232 

.7e 0.08054458091031 

.8e 0.06108200165303 

.9e 0.04488604552887 

e 0.03217506012167 



0.00000000000000 
0.04290244113814 
0.09964030214245 
0.13016990689938 
0.13436343801038 
0.12189732467982 
0.10191772165233 
0.08054458091034 
0.06108200165308 
0.04488604552886 
0.03217506012167 













1 x 10- 14 

1 x 10~ 14 

3 x 10- 14 

5 x 10~ 14 

1 x 10" 14 





5 Conclusions 

We present a method for solving delay differential equations. This method is easy to 
implement and yields desired accuracy in minimum number of terms, such that numer- 
ical results demonstrate this issue. Comparing this method with several other methods 
shows that the method is more efficient and accurate. We also observed that the method 
works excellently for nonlinear delay differential equations. Thus the proposed method is 
suggested as an efficient method for the numerical solution of delay differential equations. 
The computations associated with the experiments discussed above, were performed in 
Maple 10 on PC, CPU 2.4 GHz. 
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Abstract 

In this paper, a general theorem concerning cp — \r\ 
factors of infinite series has been proved 



1. Introduction 

Let \(p n ) be a sequence of positive real numbers, let ^a n be an infinite series with 

the sequence of partial sums \s n ) . Let {u n ) denote the n-th (C, 1) means of the seque- 
nce (na n ). The series ^a„ is said to be summable |C,l| , k > 1, if (see[l]) 

(i-D Z-kl 



< oo. 



and it is said to be summable <p - |C,l| , k > 1 , if (see [5]) 



(1-2) I 



.k-i 



<Pn L. I* 



k 

n =\ n 



U,\ < 00 . 



If we are taking cp n =n, <p - |C,l| reduces to C,l summability . 
Let \p n ) be a sequence of positive numbers such that 

P n = Y!n=oPn "> °° ttS H ~> °° iP-1 = P l = ) ' 

The sequence-to-sequence transformation 

1 " 
(!-3) v n = —Y,P v s v 

defines the sequence (y n ) of the Riesz mean or simply the \N ,p n ) mean of the sequence 
\s n ) generated by the sequence of coefficients (p n ) (see[2]) . The series 
^a n is sais to be summable |/?, p n | , k > 1 if 

CO 

(i.4) 5yi v „- v -ir< 



oo. 



In the special case when p n = 1 for all n, then I/?, p\ summability is the same as 
|C,l| summability . The series ^a n is summable (p — \R, p n \ , k>\, if 

CO 

Xk— 1 I 1^ 

V -V , <oo. 

n=\ 
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For <p n =n, <p-\R,p\ summability is the same as |-K,p„|. summability . 

For arbitrary lower triangular matrix T = (t nv ), the series Vfl n is summable |7J , k > 1, 
if 

oo 

(i.5) z» w K-if <«>• 

where 

n 

'■n = 2-1 nk S k ■ 
k=0 

The series V a is summable m — 71 , k > 1 , if 

00 

d-6) x^'K-ir <a) - 

«=i 

Two matrices T = (t nv ) and T = [i m , ) can be associated with T as follows : 
The entries t nv and t m are defined by 

n 

(1.7) t , = Vf , i = t -t , . 

v / w/; / i bv ' wv n,v n-l,v 

v=k 

By above, we have 



n n n n n 



( L8 ) ? „ = Yj l nk S k = Z^Z a v = IX IX' = Yjnv a v . 

« «-l » 

(1.9) F := t -t , =Vf a -TV , a =Vf a , as t , =0. 



v=0 v=0 i'=0 



Concrning C,l summability, Mazhar [3] has proved the following 

Theorem 1.1. If 

(1.10) A, m =o(l), as m-> oo, 

m 

(1.11) ^n\ogn\A 2 A I = 0(1), as m -> go 

n=\ 
m \f 

(1.12) V^-L = O(logm) as m — > oo , 

v=l v 

then the series y\ci n A n is summable \C,l\ , &>1. 

Ozarslan [4], on the other hand , generalized the previous result by giving the following 

Theorem 1.2. Let \<p n ) be a sequence of positive real numbers and the conditions 
(l.lO)-(l.ll) of Theorem (1.10) are satisfied . If 

(1.13) £ tl'vl* =0(logm) a^m^oo, 
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„*-l 



k 

V v J 



n=v il 

then the series 2^a n A n is summable #>-|C,l|,, k>\. 

It should be mentioned that on taking <p n =n in Theorem (1.2), we get Theorem 1.1 



2. Main result 

Theorem 2.1. Let ((p n ),(Z n ) be sequences of positive real numbers such that (Z n ) is 
non-decreasing and the condition (1.10), is satisfied. If 
(2.D K\ = 0(pJP n ), 

(2-2) hAZ„=0(Z„), 

(2.3) J>Z„|A 2 A„|<a), 

(2A) ZKv||d| = 0^|), 

(2.5) f<P k - l \t H? \ = o((v\t I)*" 1 ), 

1 / / t t n \nn\ \ nv \ \\ | w\/ / ' 

(2-6) ZK v | = 0fcJ, 

(2.7) i^nq>q = o(/-%,n, 

n=v 

(2-8) i^% w ,f =0(1). 

TTzen f/ze series / aZ„Z„ is summable g> — \T\ j whenever / a„ is summable 

^^^J ft ft \ \ K m^^m4 

\R,Pnl,k>\. 



3. Lemma 

The following Lemma is needed 
Lemma 3.1. The conditions (1.10) and (2.4) implies 

oo 

(3-1) IX K I =0(!)' 

n=\ 

00 
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(3.3) 
(3.4) 
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nZ n |AA B | = 0(1), as n—> qo, 

Z„UJ = 0(1), as n-^ co. 



Proof. By virtue of (1.10), 

00 00 



00 CO 



n=\ 



n=\ v=n 



v=l 




0(1) 


CO 

I 


vZ v A 2 /l v 



i'=l 



n=l 



= 0(1). 

CO 00 00 

yu iiaz j =yiAz j y a|i i 

/ j I n 1 1 « I / j I m I / j I i' I 

n-1 y=n 

00 OO 

<S|AZ„|XK| 

«=1 v=n 

oo i> 

= 0(1)XK,EI AZ «I 

v=l n=\ 

CO V 

= o(i)XKIZ(z„ +1 -z„) 

v=l n=l 

00 

= 0(i)XlH|(2„-z,) 

v=l 

00 

= 0(1)XK|Z V 
= 0(1). 



rZ. \AA\ = nZ„ 



5>|AA V 



</zZ„£|a|AA v || 

00 

<nZ„X|AX| 

oo 

= 0(1)£vZ v |a 2 ;I v 
= 0(1). 



z„UJ = z„ YaIaI 

« | n | n / j | v | 

CO 

<Z„X|AA V ,| 
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0(l)£z v |A^ v | 

v=n 

O (1) , by the first part . 



4. Proof of Theorem 2.1 



Let (t n ) denote the \N,p n ) mean of the series ^a n . By definition, we have 



X n '■- K ~ t n-\ ~ 



Pn 



P P 

1 n 1 n-l v=l 



2>. 



Also, we write 



n v 



T = > t ? aAZ =? a AZ ? t = > t a A Z 

n / i nv / i ill / i ill / i nv / , til i i i 



v=Q 1=0 



/=() 



Therefore, we have 



Y :=T -T ,=Yi a AZ . 

n n n-l / < nv v v v 



v=() 



By Abel's transformation, 



r„=E p v-i^ 



t A Z 

nv v v 



i'=l 
n-l f 



= 1 Z P r-^ \ 

v=l V r=\ J 

n-l p p 
v=l P v 



t AZ 

nv v v 

P 

V r v-l J 



E^i< 



Vv=l 



r 2 z 

nn n n 



j K \ 



(i =t ) 

\ rcn nn ) 



f t AZ 

nv v v 

P 

V ^v-1 J 



PP. t AZ 

i « n-l y nn n n 



Pn 



n-l 



= Z 



XiU. +^XAfi„Z„ +— X.L.mAIZ. +- tL JfX.., 1 l. l1 AZ.. 



r=l 



V «V V V 



" y " nv"v v ' "^ v «,v+l v v 

Pv Pv Pv 



n,v+\ v+1 



+ ^X f /I Z 



' n nn n n 



- v +Y +Y +Y +Y 

1 n\ ^ 1 n2 ^ 1 ni ^ 1 nA Ti n5 • 



In order to prove the theorem, by Minkowski's inequality, it is sufficient to show that 

oo 

I^-'W^oo, r = 1,2,3,4,5. 
«=i 
We now applying Holder' s inequality 



in in 

^j t n | n\ | ^j r « 



«=1 



n-l 

Yxi AZ 

/ j v nv v v 



r=l 
m «-l 



= oa)I^ZI x v 



f ( n-l 

\k \ nv\ \ . \k\ rJ \k 

\A,\ \z 



I Jfc — 1 I v | | v| 



n=\ 



=1 



V 1 I \\" 

/ \t \\t 
/ j | w | nv 

Vv=l 



y-i 

J 
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= o(D2>ng* - l z\x 



k—\ ^~~^ | vr \k | nv | \ ~ \k \ ry |fc 

A. Z. 



I Jfc— 1 I "| | v| 



H = l 



1=1 



= o(i)Xl^v| ]-^itW l z vl I>« M f 

m 

= o(i)£v t - 1 |x v |* 



i=i 



0(1). 



z^r%r=z^ w 



n-l p 

/ i i' v nv v v 

,=i P v 



in n-l T)k 

= o(i)y/- i y%|z \%t 

„=1 v =l P v 

= oa)Z^>J Z%W 

«=1 V=l Pv 



( n-l 



,k-\ 



Ulzt 



V V 



y\At 

/ > | v nv 

Vv=i y 



AX 



A. Z. 



= (i)y^-ix rurz*y©*- i |f r af 

V / / ; & I v I I v I i' / j rn \nn\ \ v n 

v=l P v n=v 



= 0(V)Y^ Y \X v \ k v k -% 

v=l P v 

m 

v=l 

= 0(1). 



ijfc— 1 



Etf%r = z>r 



v=l Pv 



n-l r>k 



( 'n-l 



,*-l 



= o(d E^E^lM K|z* ik|z v 

v=l Pv Vv=l 



n=l 



= oa)Z^rKl z vZ^T-^r 

v=l Pv «=v 

m 

= 0(l)Yv k - 1 \X J*.v|AA JZ„ 

v=l 

m-1 f V \ v ( J2Lm 1 

= oa)SS rW W A (vK|z ¥ )+oa) ly-^i* m|A,yz B 

v-z-l V r=l / V ^=1 / 

m-1 m-1 

= 0(1) XKI^v + 0(1) £(v + l)|AZ v ||A/t v | 

V>=1 l'=l 

m-1 

+ 0(l)X(v + l)Z v+1 |A 2 /ll + 0(l) 



v=l 
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= 0(1) + 0(1) X Z I A A,| + 0(1) J>z v |aU v | + o(i) 

v=l v=l 

= 0(1). 



= o(D i>r 



Z — X v f n , v+1 A V+1 AZ, 

,=1 Pv 






m n-1 p^ , f n—\ \ 

= 0(1) V ffl H t% rif J U I |AZ I YU ||AZ I 

v ' ' ' 'n / ; k\ v\ I «,v+l I I v| I v\ / > I v|| v I 

y=l P v \v=l J 



n = \ 



n-\ 






0(l)£v"|X v |*|A v ,|Z v 

m 

o(i)£v M |x v f 



v=l 



0(1). 



^ \ Y n5\ =l u ( Pn 



n=\ 



— X t /I Z 

n nn n n 

Pn 



oQi^r^wiuw z * 

n=l P„ 

m 

:0(i)x^"Kr 

«=1 
0(1). 



5. Applications 

Corollary 5.1. Lef (^ n ) , (Z n ) be sequences of positive real numbers such that\Z n )is 
non-decreasing and let the conditions (1.10), (2.2) and (2.3) are satisfied. If 



(5.1) 
(5.2) 
(5.3) 



V 1 Yn Pn _ q 



pkp 

n=v * n n-1 



k-1 



r \ 



k-l\ 



V 



v-1 



y p v j 



j 



f fTV = o 



' m 'p.^ 



P P 

n=v l n n-\ 



V 



(Pn 



Pn 



V 

\ k 



P v l p ,v 



P P 

V n n-1 J 



o(i/p/). 
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Then the series ^a n A n Z n is summable (p — \R,p n \ whenever ^a n is summable 



Proof . Follows from theorem 2.1 by putting 



t =^. t = z&±. a? =- p - p * 



p nv p p nv p p 

r n r n r n-\ I n 1 n-\ 

Corollary 5.2. Let \ff> n ), \Z n ) be sequences of positive real numbers such that \Z n )is 
non-decreasing and the conditions (1.10), (2.2) and (2.3) are satisfied . If 

(5-4) i^ = 0(l/v), 

n=v '*' 

then the series ^a n X n Z n is summable ^> — |C,l| , .whenever ^a n is summable \C,l\ t , 

k>l. 

Proof. Folows from Corollary 5.1, by putting p n = 1 for all n . 
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Integral Inequalities Concerning Triple Integrals 



W. T. Sulaiman 

Department of Computer Engineering, College of Engineering, 

University of Mosul ,Iraq 



Abstract. New integral inequality concerning triple 
integrals is presented. 



1. Introduction 



The following open question was proposed in [2] 
Under what conditions does the inequality 

i i 

(1.1) \f a+f} (x)dx>\x p f a (x)dx 



hold for a and /3 ? 

In[l], the authors gave an answer by establishing the following 

Theorem . If the function f satisfies 

i 1 - 2 

(1.2) \f(t)dt>—-, Vxg[0,1], 

o 2 

then 

i i 

\f a+p (x)dx > JV f a (x)dx 



for every real a > 1 and f3 > 0. 

The aim of this paper is that to give a new theorem concerning a similar result but for triple 
integrals and over the domain [a,b] x [a,b] x[a,b]. 

2. New inequality 

We state and prove the following 
Theorem 2.1. Let f(x,y) , g(x) , h(y), k(z) be continuous functions defined on [a,b] x [a,b] x 
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[a,b],[a,b] , respectively, f is nonnegative, g{a) = h(a) =k(a) = 0, g'(x),h'(y),k'(z) > 1. 
Let a>\, j3,y,S>0. If 

b b b b b b 

(2.1) [[[ f(t,u,v)dtdudv > [[[ g(t)g'(t)h(u)h'(u)k(z)k'(z)dtdudv , 

z y x z y x 

V x,y,ze[a,b] 



= l -{g\b)-g\x)){h\b)-h\y))(k\b)-k\z)\ 

o 



then 



b b b 



b b b 



(2.2) HI f a+f,+r+s (x,y,z)dxdydz > HI f a (x,y,z)g f, (x)h'(y)k s (z)dxdydz, 



provided one of the following holds : 

(i) g(x),h(y),k(z) >1 V x,y,z e [a,b]. 

a(y+S) a{J3+S) a(P+y) 

(H) 8 



P+r+s (U\u P+r+s 



(b)h p+r+0 (b)k p+r+0 (b) > 



(a + J3 + 1) (a + y + 1) (a + S + 1) 



P 



' a + P + y + S^ 



fi + y + S 



+ 1 



r 



' a + fi+y+S^ 
P + y + S 



+ 1 



1 



a(y+S) 



(Hi) max 



(b) 



8 

h a(p+S) (b) 



a + J3 + S + 1 
a + /3 + 8 



+ 1 



J 

r a(/3+ r ) 



(b) 



h P+Y+S (b) k P+7+S (b) ' g P+7+S (b) k P+Y+S (b) ' g fi+7+S (b) h P+r+S (b) 

(a + /3 + y + S + \) 



< 



(a + ft + 1) (a + y + 1) (a + S + 1) 



Proof . By changing the order of the integration, we have 

b b b b b b 

J J J J J J f^' u ^S P \ x )S '(x)h r ~ l (y)h'(y)k s ~ l (z) k'(z) dt du dv dx dy dz 



a a a z y x 



( b b f b b 



= H H Hf{t,u,v)g p -\x)g\x)dtdx 

a z \a y \a x 

b b ( b b (b t 

= H H lf(t,u,v)dtlg p \x)g'(x)dx 



h r - l (y)h'(y)dudy 



k°-'(z)k'(z)dvdz 



a z \a y \a 
b b ( b b f b 



h r -\y)h\y)dudy 



k d -\z)k\z)dvdz 



V\\ ll\lf(t^v)g\t)dt 

i a 



\a y \a 



h r (y)h'{y)dudy 



k (z)k'(z)dvdz 



b b 



= -{{ lg p (t)dtHf(t,u,vW-\y)h\y)dudy 

* a z \a 



a v 



k d -\z)k'(z)dvdz 
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-. b b I b b u 

= -\\ jV (t)dt \f(t,u,v)du\h r -\y)h\y)dy 

i a z \a a a 



k d - l (z)k'(z)dvdz 



(b 



= — \\ \g p (t)dt\f(t,u,v)h r (u)du 

ii a z \a a 



b b 



f b b 



-» V V I V V 

— JJ \\f{t,u,v)g p (tW{u)dtdu 

i f a z V a a 



k a - 1 (z)k'(z)dvdz 



k d -\z)k'(z)dvdz 



b b 



b b 



-t U V u u 

= — \\g fi {t)h r {u)dtdu\\f{t,u,v)k s -\z)k\z)dvdz 

' * a a 



a z 
b 



■* b b b v 

= — ^g fi (t)h r (u)dtdu^f(t,u,v)dv^k sl (z)k'(z)dz 

it a a a a 



b b 



= f f g p (t)h r (u)dtdu \f(t,u,v)k s (v)dv 

ByS J J J 



b b b 



i ODD 

= \\\f{t,u,v)g p {t)h r {u)k s {v)dtdudv. 

By 8 j j j 

* / a a a 



Also, 



b b b b b b 



J J J J J J f^'U'^S^ l ( x )g'( x )h 7 \y)h'(y)k s l (z)k'(z) dtdudxdydvdz 



a a a z y x 



b b I b b 

J J J \ f(t> u > v )dtdudv 



a a \y x 
b b b 



g p - l {x)g\x)h r -\y)h\y)k d -\z)k\z)dxdydz 



4 o o o 

>-\\\ {g\b)-g\x)){h\b)-h 1 (y)){k\b)-k\z))g f, -\x)gXxW-\y) 

a a a 

xh'(y)k s - 1 (z)k'(z)dxdydz 

, b b 

= -\{g\b)-g\x))g l3 -\x)gXx)dx\{h 2 {b)-h 2 {y)y-\y)h\y)dy 

a a 

b 

x^(k 2 (b)-k 2 (z))k s - 1 (z)k'(z)dz 



(g p+2 (b) g^ 2 (b)Yh r+2 (b) h^ 2 (b))(k s+2 (b) k s+2 (b)^ 



Therefore 



p p+\ ){ r r + 2 
g e +2 (b)h r+2 (b)k s+2 (b) 

~ pyS{p + 2){y + 2){S + 2) 

b b b 

\\\ f(t,u,v)g p (t)h r (u)k s (v)dtdudv 



5 8 + 2 



> 



g p+2 (b)h r+2 (b)k s+2 ib) 
(j3 + 2)(y + 2)(S + 2) 
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Now, by the AG inequality, 

-f a {x,y,z) + — g a {x)h a {y)k a (z) > f{x,y,z)g a - l {x)h a -\y)k a -\z). 
a a 

Multiplying the above inequality by a g ^ (x) h r (y) k s (z) , we obtain 

f a {x,y,z)g p {xW{y)k s {z) > a f(x,y,z) g a+f! - l {x)h a+ ^{y)k a+s -\z) 

-{a-\)g a+p {x)h a+r {u)k a+s (z) 
> a f{x,y,z)g a+p -\x)h a+ ^{y)k a+s -\z) 
-{a -1) g a+p (x) g\x)h a+r (u)h'(u)k a+s (z)k'(z) 



Triple integration implies 



b b b 



b b b 



\\\ f a+ ^ +s (x,y,z)dxdydz > a^j f(x,y,z) g a+p -\x)h a+r -\y)k a+s -\z)dxdydz 



b b b 



(«-l)||| g a+p {x)g\x)h a+y {y)h\y)k a+s {z)k\z)dxdydz 



g a+/1+1 (b)h a+r+1 (b)k a+s+1 (b) _ g a+/l+1 (b)h a+r+1 (b)k a+s+1 (b) 



(a + P + 1) (a + y + 1) (a + 8 + 1) 
g a+p+l (b)h a+7+x (b)k a+s+l (b) 
{a + p + 1) (a + y + 1) (a + 8 + 1) 



(a + p + 1) (a + y + 1) (a + 8 + 1) 



We now deal with the three cases separately 



Case 1. We claim that P 



a+P+y+8 
P + y + 8 



<a + P , y 



a+P+y+8 
P + y + 8 



<a + y, 



and 8 



a+P+y+8 
P + y + 8 



p<p + y + 8^l + 



< a + 8. In fact 

a a + P + y + 8 a + P 



a 



-<1 + 



-<- 



p+y+8 p p+y+8 p 



P 



a+P+y+8 
p+y+8 / 
<a + p. 



Therefore, we have 

f a (x,y,z)g p (x)hHy)k s (z) < 



a 



•a+{S+y+8 



a+P+y+8 



f a+p+r+0 {x,y,z) 



SULAIMAN: INTEGRAL INEQUALITIES 



589 



+ 



fi + y + S 
a+ p+y+8 



a+P+y+S 



(g p (x)h r (y)k s (z)) P+r+s 



(2.3) 



< 



a 



a+ fi + y + S 



R e Ja+P+r+S\ 

r +ft+ ' + '(x,y,z) + P+ J x g [ Pn+S } (x)g'(x) 

a+fi+y+S 



x h 



( a+P+y+5 \ ( a+fl+r+S '] 

' [ p+r+d \y)h\y)k [ p+r+s \z)k\z) 



a 



fi + y + S a , 



< -,f a+P+r+S {x,y,z) + H '' - s a+p (x)g'(x)h^(y)h'(y) 



a + fi + y + S' 



a + fi + y + S 



xk a+d (z)k'(z) 



Triple integrating both sides the above inequality from a to b gives 

b b b b b b 

a 



o fJJJ f a+p+r+s (x,y,z)dxdydz > \\\ f a (x,y,z)g p (x)h'(y)k s (z)dxdydz 

a+ B+y+8 i J J J J J 



fi + y + S 



b b b 



P J — \\\ g a+P {x)g\x)h a+ ^y)hXy)k a+5 {z)k\z)dxdydz 
+ fi+y+S J J J 



a + fi + y 



b b b 



> 



^ -AW f a (x,y,z)g f! (x)h'(y)k s (z)dxdydz 

a + fi + y + S J J J 



b b b 



+ 



P V + S A N {f a (x,y,z)g p (x)h 7 (y)k 5 (z)-g a+p (x)g'(x)h a+ Hy)h'(y)k a+s (z)k'(z)) 
a + fi + y + S J J J 

i i a a a 

x dxdydz 



b b b 



> 



IT AN f a (x,y,z)g p (xW(y)k s (z)dxdydz 

a + fi + y + S J J J 



a + fi + S 



( „ a +P+Uu\h a +r+Uu\ha+s+\ 



a + fi + y + S 



g a+p+l (b) h a+y+i (b)k a+d+l (b) g a+p+l (b) h a+y+i (b)k a+6+i (b) 



a +P+ l (U\h a +r+ l (U\h a + s+l (U\ \ 



(a + fi + l)(a + y + Y)(a + 5 + 1) (a + fi + 1) (a + y + l)(a + S + 1) 



b b b 



= — f f f f a (x,y,z)g p (x)h r (y)k 3 (z) dxdydz. 

a+ fi+y+S J J J 



Case 2. Since g(x)>0, g is increasing. Hence g(x)> g(a) = 0. Also h(u), k(v) >0.It is not 
difficult to show that (ii) implies 



a+p+r+s *\ n (a+p+r+s\ s ( a+ P +r+s )q 

P+r+S ] (b)h [ P+y+S > (b)k [ P+r+d ] (b) 



ft 



' a + fi + y + S^ 
/3 + y + 8 



+ 1 



r 



r a + fi + y + S^ 
/3 + y + 8 



+ 1 



S 



' a + fi + y + S^ 
/3 + y + S 



+ 1 
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< 



g a+li+ \b)h a+r+l {b)h a+5+l {b) 
{a + J3 + 1) (a + y + 1) (a + 8 + 1) 



Therefore, from (2.3) we have 



b b b 



a rrr f * + ^ {x ^ z)dx dydz 

+ B + y + S J J J 



a + P + y 



a a a 
b b b 



> 



—^ F \\\ f a i.x,y,z)g\xW{y)k\z)dxdy 

+ B + y + 5 JJJ 



a + P + y 



dz 



+ 



P + y + S 



( b b b 



§^f a (x,y,z)g fl (x)h r (y)k s (z)dxdydz 



l,l, h fl | a+P+y+S a+P+y+S J a+P+y+S 

p+r+s j , x i, s, v p+y+s L A u't^Mr v P+r+s 



b b b 



a + B + y + S\ J J J 
\\\ g V P+r+d , {x)g\x)h ' ^ +d ) {y)h'{y)k ' ^ +d \z)k'{z)dxdydz 

/3 + y + S 



> -JJJ f a (x,y,z)g fi (x)h^y)k s (z)dxdydz + -?—f- x 

+ B + y + 5 J J J a + B + Y + 



a + P + y + S J J J 

i f a a a 



a+P+y+S 



( g a+ P +l (b)h a+r+1 (b)k a+s+l (b) 
(a + p + 1) (a + y + 1) (a + 5 + 1) 



( a+P+y+S \ ( a+p+y+S \ ( a+p+y+S \ 

• { - P ' Y+S \b)h^ Pn+S \b)k^ P+7+S \b) 



P 



a+P+y+S 
p+y+S _ 



+ 1 



a+P+y+S 
P + y + S 





/ 


+1 


5 




V 



a+P+y+S 
p+y+S _ 



+ 1 



b b b 



> 



P + y + S 



-? -JJJ f a (x,y,z)g f} (xW(y)k s (z)dxdydz + P J 

a + B+v + S J J J a + B + v + 



P + y + S 



g a+p+ \b)h a+r+ \b)k a+5+ \b) g a+p+ \b)h a+r+ \b)k a+s+ \b) ^ 



(a + P + 1) (a + y + 1) (a + 5 + 1) (a + P + l)(a + y + l)(a + S + 1) 



b b b 



= ^ rJJJ f a (x,y,z)g fi (xW(y)k*(z)dxdydz. 

a + B + y + S JJ J 



P + y 
Case 3. We have, by (iii), 

f a {x,y,z)g p {xW{y)k s {z)< 



a 



a+P+y+S 



f a+ ^( X ,y, Z ) + 



P 



a+P+y+S 



a+P+y+S 



(X) 



SULAIMAN: INTEGRAL INEQUALITIES 591 



+ Y - h a+p+y+& \y) + k a+p+r+s (z) 

a+P+y+8 a+P+y+8 



< / / a+p+y+s (x,y,z)+ / g a+p+r+s (x) g \x) 

a+P+y+8 a+P+y+8 

7 l-a+p+r+S s,\ t// \ _i_ h a+/3+r+S . 



s h a+ ^ +6 (y)h'(y) + -^ -k a+p+ ^ d (z)k'(z), 

a+P+y+8 a+P+y+8 

which implies 

-T s\\\ f a+ " +r+S (x,y,z)dxdydz> -^ -JJJ f a {x,y,z)g p {xW{y) 

a + B+y+8 : J J a + B+y+8 i J J 

i * a a a * * a a a 

x k s (z)dxdydz 



b b b 



n ODD 

+ f- -\\\ (f a (x,y,z)g p (xW(y)k s (z)-g a+p+r+s (x)g'(x))dxdydz 

a + P + y + 8 JJJ 

i f a a a 

b b b 

+ / F fff {f a {x,y,z)g p {xW{y)k\z)-h a+ ^ s {y)hXy))dxdydz 

a + P + y + 8 J J J 

i f a a a 

o b b b 

+ rJJJ (f a (x,y,z)g p (x)h'(y)k 5 (z)-k a+ ^ +s (z)k'(z))dxdydz 

a+ B + y + 8 JJ J 



P + y 

a 
ply 



a a a 
b b b 



> tJJJ f a (x,y,z)g p (x)h r (y)k\z)dxdydz 

a + fi + y + 8 J J J 



P b c\ b c ( g a+p+l (b)h a+r+ \b)k a+s+ \b) g a+p+r+s+l (b)(b-a) 2 ^ 

a + P + y + s{{{ \ y (a + P + V)(a + y + l)(a + 8 + l) a + P + y + 8 + 1 

b b b f °'+A+l/'J,\ 7 a +Y + \ (U\ 7 a+<5+l /i x ; a+fl+y+S+l /t\ /i \2 A 



7 f f f g a+p+ \b)h a+r+i (b)k a+0+ \b) h a+p+r+0+ \b)(b-a) 



J 



| r rrr g (P)n kp)k ko) n KO)Ko-a 

a + P + y + s{{{ {(a + P + l)(a + y + l)(a + S + l) a + P + y + S + l 

8 \\\ ( g a+p+l (b)h a+r+l (b)k a+s+l (b) k a+p+r+s+l (b)(b- a 

a + P + y + s{{{ {(a + P + l)(a + y + l)(a + S + l) a + P + y + 8 + 1 

b b b 

> ^ -fff f a (x,y,z)g p (x)h r (y)k s (z)dxdydz 

a + fi+y+8 J J J 

i f a a a 



P rrr ( g a+p+ \b)h a+r+ \b)k a+s+ \b) g a+p+ \b)h a+n \b)k a+s+ \b) ^ 



p rrr g y°> n y°> K w s w« w* \o) 

a+P+y+8* J ■* l(a + ^ + l)(a + f + !)(« + £ + !) (a + /? + l)(« + f + l)(a + £ + l) 
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+ 



b b b 



,a+fi+l(l,\ h a+ y +X (h\ h a+S+1 



a+ P +x <h\u a+ r+Uu\h a+s+ UK\ \ 



+ 



r rrr g y°) n yp)* yp) § \°) n igJ* kp) 
a + j3 + y + s{{{ \(a + P + \)(a + y + \)(a + d + \) (a + P + \)(a + y + \){a + 5 + 

5 rrr ( g a+m (b)h a+r+1 (b)k a+S+1 (b) g a+l3+l (b)h a+r+i (b)k a+S+l (b) 

a + /3 + y + s{{{ \(a + P + \)(a + y + l)(a + S + l) (a + /3 + l)(a + y + \){a + 8 + 

b b b 

= -^ -fff f a (x,y,z)g fS (xW(y)k s (z)dxdydz. 

a + fi + y + S J J J 

i f a a a 



3. Applications 



Corollary 3.1. Suppose that the statement of Theorem 2.1 is satisfied. If 



b b b 



b b b 



(3.1) [[[ f(t,u,v)dtdudv > [[[ tuvdtdudv Vx,y,ze[0,b\, 

z y x z y x 

then 

b b b b b b 

(3.2) \\\ f a+p+7+d (x,y,z)dxdydz > \\\ f a (x,y,z)x p y 7 z s dxdydz , 



ooo 







provided 



b a > 





{a + f3 


+ 


\)(a + y + l)(a-\- 


-8 


+ D 




p 


f a \ 
K /3 + y + Sj 


+ 1 


r 


f a \ 
K /3 + y + Sj 


+ 1 


5 


f a > 
K /3 + y + Sj 


+ 1 



Proof. Follows from Theorem 2.1, using (ii) and putting a = 0, g(z) = h(z)=k(z) =z. 



Corollary 3.2. Suppose that the statement of Theorem 2.1 is satisfied . If 

b b b b b b 

(3.3) [[[ f(t,u,v)dtdudv > \\\ (t -a)(u-a)(v-a)dtdudv \f x,y,ze[a,b], 

z y x z y x 

then 

b b b b b b 

(3.4) j*]*]" f a+/}+r+s (x,y,z)dxdydz > \\\ f a {x,y,z){x-aY (y -af (z-a) s 'dxdydz, 

a a a a a a 

provided 



(b-a) a > 



{a + P + 1) (a + y + 1) (a + 8 + 1) 

(a + P + y + S + l) 



Proof. Follows from Theorem 2.1, by putting g(z) = h(z) = k(z) = z- a . 
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Abstract. Several new inequalities concerning 
improvement, generalization and new results of 
concave functions are obtained . 



1 . Introduction 

A real valued function /defined on an interval / <z 9? will be termed convex if 

f(ta + (l-t)b)<tf(a) + (l-t)f(b), a,bel, 0<t<l, 
or equivalently 



/ 



a + b 



< 



f(a) + f(b) 



If the above inequalities are reversed then /is said to be concave . 

In [1], Pearce and Pecaric established the following results concerning concave 
functions 

Theorem 1.1. If f : [a,b] — > 9? + is continuous and concave, then for all real 
numbers a,/3 with <a + f3 < 1, < /3 < 1, 



-a) J J a + B 



a+p 



(b-aYi 



a + b 

•a+/3 



(5 f a+p (a) + f a+ e(b) 



a + (3 



Theorem 1.2. If f : [a,b] — > 9? + is continuous and concave, then for all real 
numbers f3 with < f3 < 1, 



I b b 

77 \f fi (t)dt\r P Wt< \ + p\og 

(p-af[ { 



f((a + b)/2) 

yjf(a)f(b) 



2. Results 
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Lemma 2.1. Let a,b>0. Then 



(2.1) 



(2.2) 



a + b 



a + b 



\ a 



a , i a 



> 



a a +b 

a a +b c 



< a < 1, 



< 



a>\. 



Proof. Let 0<x<l, < a < 1 . Consider 



fix) 



x + l 



\a a 



X° +1 



We have 



fix) = 



a 






X 



a-1 



>0. 



Therefore / is non-decreasing. Since f(0) = — > 0, then, we have 

/(x)>/(0)>0. 

Set c = min \a,b) , C = max{a,b] , then (2.1) follows by putting x = c/C . The 
proof of the other part is similar . 



Remark 1. The above Lemma shows the following 
(i) / > 0, / concave =^> / a concave if < a < 1 . 
(ii) / > 0, / convex =^> / a convex if a>\. 



(i) follows as f a 



a + b 



> 



f(a) + f(b)) ^ f a (a) + f a (b) 



The proof of the other part is the same 



'a + b^ l •'--' 



a+b' 



Lemma 2.2. Let a,b > 0. then 

(2.3) 



ft r\ 

Proof. Since (b-a) 2 > => lab < a 2 +b 2 => — + — > 2, then, we have 

b a 

a b 



(a + b)(a- l +b- l ) = 2 + - + ->4 



b a 



which implies (2.3) . 

Remark 2. If f~ 1 (a) = f(a)~ l , then 
/ > 0, / concave =^> / ~ convex 
This follows from Lemma 2.2 : 



/ concave implies /" 



'a + b^ Jf(a) + f(b)\ l f-\a) + f-\b) 
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The following giving a refinement for the generalization of Hadamard's inequality. 



Theorem 2.2. Let f : [a,b] — > 9? be convex, and g : [a,b] — > 9? J be integrable 



and symmetric with respect to the lines ^t l , 2i r L . Then 



■A+b 



(2.4) f 



f a + b" h 



jg(x)dx< jf(x)g(x)dx< - 



/(«) + / 



a + b 



jg(x)dx 



1 

+ — 
2 



f f a + b^ 



f\Z±l +f(b)\\g(x)dx 



\ v z y 



2 



In particular for g(x) = 1, 



a + b 



\ ^ J 



^^—\mdx<y 

h — n •> } 



b-a 



f a + b^ 



\ ^ J 



f(a) + f(b)^f(a) + f(b) 



< 



If f is concave, (2.4), (2.5) reversed. 



f 



a + ■ 



■- x 



v 



J 



0<t<l 



Proof. It is sufficient to prove the middle inequality . As / convex implies 

' a + b \ J a + b ,. ^a + b^ 
f a + ta-(\-t) 

\ 2 2 

J ,-i n a+b^ 
= f (X-i)a + t—— 

V 2 



J 



<(\-t)f(a) + tf 

,■ . , a + b 
= /(«) + / 



J 
' ' a + b^ 



<f(a) + f 



v ^ y 



^(a) + (l-0/ 



a + Z? 



V 2 yy 



= /(«) + / 



v 2 j 
r a + b ^ 

V 2 , 



-/ to + (i-0— - 

v ^ y 



•/(*), 



then, we have 



A+b 
2 



jf(x)g(x)dx= J + | =/ l +/ 2 , 



2+i 

2 



a+b 

a+b 

2 



f a + b^ 



h = } f(x)g(x)dx< \\f(a) + f ?Lf- -f 



v ^ y 



a + ■ 



a + Z? 



#(.*:) d* 



yy 



a+b 

/(«) + / 



a+b 



V 



a + M) 2 r , , , z r J a + b ' 
— — J g(x)dx- J / a + ^~ -- 



g(x)<i.*; 
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a+b 

/(«) + / 



a + b 



a+b 
2 



which implies 



v 2 
a + b 



| gO)<&- | / « + 



a + b 



a + ■ 



a + b 



dx 



a+b 



a+b 



\ 



| g(x)^~ J/(x)g(x)^ ; 



/,<" 



1 






f(a) + f 



a +b 



jg(x)ch 



Similarly, 



h^ 



r a + b^ 



f °-y- + f{b)\\g{x)dx 



V v *■ J 



a+b 

~2~ 



The following is an improvement of Theorems 1.1 and 1.2 via simpler method 



Theorem 2.3. 7f / : [a,&] — » 91 + zs continuous and concave, then for all real 
numbers a, J3 with <a + P <l,0 < P <l, 

-a) J J a + /? 



(2.6; 



(^-< J fl 



v ^ y 

?a+fl 



p r\d)+r\b) 



a + P 4 

Proof. On the light of inequality (2.5) for concave functions 

1 b b , b b 

- Y \f a (t)dt[f fi (t)dt=- - Y \if a (x)f\t)dxdt 



(b-a) 2 {[{a + (3 J a + (3 J 

b-a : \a + /3 a + P 
=T^-\f a+P (t)dt 

h — n J 



dxdt 



dt 



b-a- 

a 

a + 2(3 1 
a + P b- 



P 1 



- f f a+p (t)dt £ — \f a+fi (t)dt 

a J a+ B b-a J 



< a + ip_ fa +p{a + b\ p 



a + P 



J 



a + P 



•a+P (U\\ 



I q+Ja + b^ f a+p {a) + f a+p {b) 
2 7 t 2 J 4 
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a + (3 \ l j 



P f a+p (a) + f a+p (b) 



a+P . 



a + (3 



Theorem 2.4. If f :[a,b] — > S J{ + is numbers jB with 0< j3 < 1, continuous and 
concave, then for all real 

(2.7) l*_l_J/>(0d>J/-'(0* <l + ^ln' /((a + Z7)/2) 



JfWfib) 



Proof. 



,1/2 



^6 



i = -^-)f p,2 (t)r m (t)dt<-^-[)f\t)dt I I j /->(o# 

b-a J b-a\ J I { 

Squaring the above implies the left inequality. 

The right inequality in(2.6) can be written in the form 



.1/2 



/ 



a+p 



f „ , U\ 



a +b 



1 — Y \f a (t)dt\f fl (t)dt<\ + (a + ^) ^ 

-a) J J ' a + 



(b-aYi 



2 _^ 
P 



I 
4 



( fa+P 



f a+p {a)-\ | f a+p (b)-\ 
a + /3 a + P 



J 



Now making use of the limit, lim = In a, the above inequality gives the right 

t-X) t 

inequality in (2.7) as a — > -p. 



The following result giving the reverse case 



Theorem 2.5. If f : [a,b] — > S J{ + zs continuous and concave, then for all real 
numbers P > 0, < a - /? < 1, 

(2.8) — L^f /«(,)#?/-*(,)# > i^zA f c-/ a+b ' 

(£-a) 2 J J «-/? 



V 2 , 

a-p 



a f a - p (a) + f a - p (b) 
a-p 4 



In particular 

(2.9) 

Proof. 



1 a /? 

- -^\f p {t)dt\f- p (t)dt>\ + p\n 



f f(a)f(b) A 
Jf{{a + b)ll) 



6 6 , 



(b-a) 2 [ J i J (b-a) 2 {{{a-p J a~p J 



dxdt 
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= 7^—J fa ^ t)dt 



ft 



a-p ] a-p ] 



>- 



a 



[y 



a-p 



r a + b^ 



a-pyi 



f a -v(a) + f a -»(b) 



<*-0fu\\ 



V ^ J 



4 
P 



<*-P .a-P 



a-p 



r a + b^ 



V * J 



a-p 

-a-p 



f 



J 

' a + b ^ 
v 2 J 



a f a - p (a) + f a -V(b) 



a-p 



Inequality (2.8) can be written as 



, b b 

rr J f a (t)dt\f - p (t)dt> 1 + (\a-p) 

Q>-a) J a J a 



By letting a — > P, the above implies 



/ 



a-p 



r a + b^ 



V * J 



a-p 



a 

+ — 
4 



f(a) + f a - /j (b)-l 



a- P a- P 



(b-a) 2 { J [ J " Jf((a + b)/2) 



Other kinds are also obtained 

Theorem 2.6. If f : [a,b] — > s Jl + is continuous and concave, then for all real 
numbers a > 0, /? > 0, a + fi = 1, 

(2.10) —l—\f«(t)dt\f- fi {t)dt < af{^ 



+ J3 



( \ * 



r 



a + b 



V ^ J 



-Wi\\ 



f-\a) + r\b) 



Proof. By virtue of Remark 2, 



b b 



-^\f a (t)dt\f- p (t)dt = -\\f(x)f- fi (t)dxdt 

(P-a) i i Q>-a) it 



< 



-a) 2 {{{a + P a + P 



(b-aYi- 



dxdt 



b b 



-\\{af(x) + Pf\t))dxdt 

(P-O) a a 
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1 h 

=- \{af{x)dx + /3f\t)dt) 

h — n J 



b — a 

LI 



( \ r 



f 



a + b 



V ^ J 



\tU\\ 



f-\a) + f-\b) 



Theorem 2.7. If f : [a,b] — » SR + is continuous and concave, then for all real 
numbers 
a,fi,s,t,p,q, p<l/a, q> 1/ J3,s <1/ a, t > -1/ J3, i + i=l,i + } = l,p > 1, 

< s < 1, 



(2.11) 



1 

2s 



f 



a + b\ x f a \a) + f as {b) \ 1 L, 



V v z J 



r a + b^ 



\ ^ J 



f- p '(a) + f- p, (b) 



* -77^1 f a (x)dx\f"(x)dx <-r 
(b-a) 2 { { p 



' a + b^ 



( f 



2q 



m a + b 

V v 2 j 



f-^(a) + f-*(b^ 



Proof. We have 



b b , 



—L^]f-(x)dx\f- p (u)du<— -i-j-JJ I/*(x) + i/-A(ii) 



dxdu 



1 1 



\\- f ap (x)dx + - f- pq (u)du 
b-a J \p q 



p V 2 ; 2g 



A a + fr 

V V 2 y 



f-»(a) + f-*(b)^ 



Also 



(fe-a) 



^j/" (x)dx\f p {u)du > _l_.fl/-(x) + I/-A( I< ) 



(&-a) v 

i n 



dxdu 



b-a 



i A 

f as (x)dx + -f - pt {u)du 
\s t 



( r 



>- 



2s 



+ - 



2? 



,,<« a + b 

\ ^ J 
R ,{ a + b 
v ^ - i 



f as (a) + f as (b) 



-P' 



r m (fi)+r"Q>) 



-P'(U\\ 



Theorem 2.8. If f : [a,b] -> 91 + w continuous and concave, then for all real 
numbers a >0,/3 > 0, a + (5 > 1, 



SULAIMAN: CONCAVE FUNCTIONS 



601 



, b b 

(2.i2) - — -\r a (x)dx\r fs (t)dt 



< 



1 a 
2a + j3 



f 



-a-p 



r a + b^ 



V z J 



a + 2^f-^(a) + f- a -"(b) 
a + p 4 



Proof. Let g - f . Then g is convex, and hence 



(b-aY 



l -—\g a {x)dx\g p {t)dt<—^—\-^g a+p {x)dx + -! 3 —g a+l3 {t)dt 
-a) 2i J (b-a)\a + B a + B 



a + 2p~p 1 



a + B b-a J 

" a 



\g a+P (t)dt 



< 



a + 2p 



a + jB 



1 



a+p\ 



P 



P(U\\ 



a+b) : g a+p {a) + g a+p (b) 



P 



a+P 



f a + b^ 



1 a 
2a + p 



a+p 



r a + b^ 



V * J 



a + p 

a+p . 



V ^ J 



a + 2/3g a ^(a) + g a+/} (b) 
a + p 4 



Therefore 



, b b 

-\f a (x)dx\f- p (t)dt 



<- 



1 a 
2a+fi 



f 



-a-p 



a + b 



V * J 



a + 2(3 f- a - p (a) + f- a -'\b) 
a + p 4 
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Abstract 

We derive a linear, nonhomogeneous, recurrence relation having two indices of recur- 
rence, with initial conditions and equated to a binomial coefficient. We relate a known 
classical combinatorial identity to partial sums of geometric series and show that a cer- 
tain corresponding sum always is a rational, non-integer term. We construct solutions 
which are rational expressions with an indeterminate form evaluated in a limit as a bino- 
mial coefficient. We establish several interesting combinatorial identities and use these 
to express sums of powers of integers (in particular, squares and cubes) as a finite sum 
of binomial coefficients with integer coefficients. 



Keywords: recurrence relation, binomial coefficient, characteristic polynomial. 



1. Introduction 

Different counting procedures and various arrangements of mathematical objects lend 
themselves to combinatorial identities. Moreover, combinatorial identities [11] arise in 
numerous settings in mathematics and commonly involve polynomials, binomial coeffi- 
cients and recurrence relations and the current paper combines all of these features. For 

example, if S p (n) = 1 + 2 P H \-n p , integers p > 0,n > 0, then it can be shown [9], (sect. 

4 contains result) that S p (n) satisfies a recurrence relation involving binomial coefficients, 
term l/(p+ 1) and Si(n), < i < n, So(n) = n. Additionally, computer-generated proofs 
[12] also generate identities as a by-product of the proof-process. 

In a sense, the present paper offers an algorithmic way of producing at the least a 
sequence of combinatorial identities whose importance or significance lies in originality. 
In one case an elegant result [10] (see sect. 2) was generalized. 

The basic idea lies in the characteristic polynomial of the j-th order Fibonacci sequence 
given by Fj(x) = x^ — x^ 1 — ... — x — 1. It is known that the positive zeros of Fj(x) are 
of the form 2 — 0(2~ J ) [1],[2]. It has also been shown [8] that the single negative zero of 
Fj(x) has the form — 1 + 0(j~ r ) for j even and tends to —1 monotonically as j — > oo. 
By factoring these zeros one gets, 

Fj(x) = (x — 2 + £j)(x + 1 — Sj)(x-'~ 2 + Oj_3 X"' -3 + • • • + a\ x + ao), 

where Sj and Ej are positive, decreasing sequences for j = 4, 6, . . .. In [7] an explicit 
form for the coefficients a^ was found by solving a non- homogeneous linear recurrence 
relation of the form — a n + ba n+ i + ca n+2 = 1 where b = e — 1 — 5, c = (1 — 5) (2 — e). As 
a byproduct of this solution, several combinatorial identities were formulated by solving 
by a standard method and comparing solutions; computer-generated and combinatorial 
proofs were also given to some identities, [4], [5], [6]. 

A recent paper, [13] has studied the asymptotics of the zeros of the positive and 
negative zeros of the derivatives and indefinite integrals of Fj as j — > oo and shown 
this behavior is monotone for each derivative and integral. Moreover, one can write for 
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sufficiently large j and by factoring that 



j-(k+2) 

;i.l) F^ = (x-2 + e j )(x + l-S j ) Yl 

i=0 



Here F- is the k th derivative of Fj. Analogous to the case k = which corresponds to 
Fj] in the present paper we derive recurrence relations of the form 

(n + k + 2 \ 
, k, n > 0, 
n + 2 / 

with initial conditions. The proof of this result is by induction over k and is given in 
section 2. In sect. 3 we discuss the special case k = 1 and c = b + 1 which we call a 
singular solution and leads to an indeterminate form for b = —2. A result in [10] is shown 
to follow easily from an interesting identity. In sect. 4 we apply these ideas to finding 
closed forms for sums of powers of integers. More specifically, we show how one can find 
a closed form that contains sums of binomial coefficients with integer coefficients for the 



following sums, ^(n) = /^i 2 and S^n) = > ^i 3 . 



i=i i=i 

2. Recurrence Relation and Identities 

In the present section we extend some results in [3]. 

Theorem 2.1. Define real numbers ak >n ,b,c such that k,n are nonnegative integers sub- 
ject to initial conditions 

(2.1) a ,o = 1/c, a ,i = 1/c + b/c 2 . 
and oo,j /or j > 2 by, 

(2.2) -o , n - &a ,„+i + ca 0in+2 = 1, n > 0, 
and 

n 

(2.3) Qfc+l,n = / Qfc.i- 

i=0 

T/ien we /iave 

/ x , /n + A; + 2\ , 

(2.4) -a fe , n - ba k>n+ i + ca fc , n+2 = , k, n > 0, 

V n + 2 J 

such that RHS of (2.4) comprise binomial coefficients in n, k relate the levels of the 
recurrence in k. 
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Proof. We prove inductively on k. Substituting (2.3) into LHS (2.4) with k — 1 and 
employing initial conditions yields, 

— Ol,n — 0Ol,n+l + Ca\ t n + 2 

n n+1 n+2 

= - y^ qq,2^ - & y^ Qo,i + c y^ a ^ 

i=0 i=0 i=0 

= — ao,o — &Oo,i + cao,2 — Oo,i — &a ,2 + ca ,3 — • • • 

— 0-0,11 — &O0,n+l + Cao,n+2 _ 0^0,0 + COQfi + COo,l 

, 1 1 /l 6\ /n + 3 

c c \c c z J \n + 2 

Thus, (2.4) is shown for k — 1. Next we note that, 
Lemma 2.1. 

(2.5) a fc , = -, 

c 

A; + 1 6 

(2.6) a M = + ^, fc > 0. 

Proof. Both results follow form the initial conditions and by repeatedly applying (2.3), 
the former with n = and the latter with n — 1. □ 

We get, by applying the previous lemma and (2.3) to (2.4) with fc = j + 1, the following 

— 0>j+l,n ~~ O a j+l,n+l + COj + 1^+2 

n n+1 n+2 

i=0 i=0 i=0 

= — (2^ q — OQjj \ ~r Ctt^ 2 — O7 l — 0(2^' 2 t Cd^ 3 — ... 

- a j:Tl - ba jjn+1 + ca j:n+2 - ba j>0 + ca j:0 + ca jtl 

'n + j + 3 N 

n + 2 

by a standard (ice or field hockey stick) result in binomial coefficients. □ 

Remark. The use of (2.3) to derive (2.4) is not obvious, but found by trial and error, 
nevertheless, it is somewhat close to the concept of superposition of families of solutions 
in linear differential equations. 

It is also advantageous to write (2.4) in odd and even cases 
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Corollary 2.1. 
,_ /2n + k + 2 

(.2.7J —0-k,2n — POfc,2n+l + COjfc^n+2 — I 9,9 

/2n + fc + 3 

(2-8) — «fc,2n+l ~~ bak t 2n+2 + COfe^n+3 — I 9 1 q 

The following solutions [7] to (2.7,2.8) are given by 

'l+&\n+l\ 1 n n+l+k 



( 2 -9) «o, 2n+1 = -i ^ + ^E £ { i -, 

\ c / fc=0 i=2fc+l v 7 

( 2 -!0) «o, 2n = -1 m l+^E E i jrf 



fc=0 i=2A:+2 



Subtracting (2.9,2.10) gives 
Corollary 2.2. 



n + 1 + k\ b 2k+1 



c k 



1 v^ (n + 1 + k\ 
( 2 - n ) ao,2n+i=a ,2n + -^2^( 2A; + 1 J 

fc=0 \ """ / 

Moreover, let b = 1 — c in 2.11 and employing a result in [7] gives 
Corollary 2.3. 

+ 1 + k\ (1 - c) 2fe+1 l/c 2 (™ +1 ) - 1 1 l/c 2 (" +1 ) - 1 



(2.12) _^y(n + l + k\ 

y ) c n+2£^\ 2k + 1 J 

I — n \ / 



,_..._, c k 1 + c c 1 + 1/c ' 

k=0 

which LHS can be computed by Maple software and RHS is related to partial sum of 
geometric series. Setting c = —1 in cor. 2.3 and employing L'Hopital's rule produces a 
result in [10] given by 

D-^( n 2 £ + i 1 )=» +1 - 

fe=0 v 7 

The derivation in [10] involved rational expressions and equating coefficients of infinite 
series. A few elegant results follow from 2.12. By setting c = 1/p, p > 1 

and 



p 2 ^— ' \ p J \ 2k + 1 7 \p r 

^ fc=0 \ ^ / \ ' / v j 

and we see 



2(n+l) 
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Corollary 2.4. If p is positive integer > 1 and n > 1 then 

(p-l) 2 \ k (n + k + V 
2k + 1 



E 

k=0 



P 



is a positive, non-integer rational number. 

Proof. We have 

(p-l) 2 \ k fn + k + l\ I 
2k + 1 



n 

E 

fc=0 



(p 2n + F 



2(n-l) 



+ P 2 + 1), 



p J \ 2k + 1 J p'- 
and the result follows by considering the remainder mod p. 

One can also easily use Maple software to help compute the following: if p ^ 0, 1 
Corollary 2.5. 

1 



□ 



E 

and also 
Corollary 2.6. 

E 

fc=0 



(P-1) 
p 



n + k + 1 
2k 



n+l 



P 



P 



2n+3 



+ 1 



p 



(P " l) 2 ^^ , 



(P~1 ) S 
P 



2\ k 



n + k + 2 
2k + 1 



n+l 



7^ 



1 — p 



2n+4 



1 — p 2 



_(p_l)2(n+l) 



It is noted that the possible elegance results from exponent 2 in the (p — l) 2 term in 
the sums. 

3. Singular solution: k=l 

In the this section we consider (2.4) with k — 1 so that, 

(3.1) -Oi, n - 6oi, n+ i + ca lyH+2 = n + 3, n > 0, 

It is well-known that the standard solution for n, both odd and even, has the following 
(real) form, 

(3.2) oi,„ = A + Bn + Cn 2 + Cl a n + c 2 /3 n , 

for some undetermined coefficients A, B, C, roots a, (3 of the characteristic equation, and 
some constants Ci, c 2 , that depend on the initial conditions. It is noted in (2.9, 2.10), the 
case c = b + 1 leads to an indeterminate form. The case, c ^ b + 1, is considered in [7]. 
We have 

(3.3) -oi,„ - 6oi, n+ i + (6 + 1) ■ Oi, n+2 = n + 3, n > 0, 

with initial conditions (2.1). The basic elementary idea is to exploit (2.4) with the 
solution (2.9, 2.10), and compare to the newly found solutions to (3.3) for odd and even 
cases. We solve (3.3) for solution a := ai >n according to a = a c + a p such that, 

— a c — ba c + (b + 1) • a c — 0, 

— a p — ba p + {b + l)a p = n + 3. 

We have the following, 
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Lemma 3.1. 

(-l) n 
(3.4) a := a 1>n = d + c 2 , ,, + Bn + Cn 2 , 

(1 + b) n 

where, 

(3-5) B^J^l , C X 



2(6 + 2) 2 ' 2(6 + 2) ' 

(3 6) 6^+56 + 7 1 



(6 + 2)3 > ' (6+1). (6 + 2)3 • 
Proof. For a c : The characteristic equation is given by: 

(6 + 1) • a 2 - 6 • a - 1 = ^=> a = 1, 



1 + 6 

We next compute 5, C by substituting a p = Bn + Cn 2 into (3.3) which yields the 
following equation, 

- C-n 2 -B-n-b-(C-{n + l) 2 + B-(n + l)) 

- b-(C -(n + 2) 2 + B-(n + 2)) = n + 3, 
which, after simplification gives 

(3.7) n ■ C ■ (26 + 4) + C • (36 + 4) + B ■ (6 + 2) = n + 3. 
We obtain from (3.7), the pair of equations: 

(3.8) C • (26 + 4) = 1, C ■ (36 + 4) + B ■ (6 + 2) = 3. 

Solving (3.8) for 5, C yields (3.5). To solve for c±, c 2 in (3.4) use (3.5) with n — 0, 1 and 
(2.6) with /c = l,c = 6+l,to get the pair of equations 

1 

Cl +c 2 



6 + 1 



c 2 1 36 + 8 36 + 2 

ci - 7— r + 



6+1 2(6 + 2) 2(6 + 2) 2 (6+1) 2 ' 
with solution (3.6). □ 

A result in [7], 
Corollary 3.1. 
,~ Q , fr + 3 1 , 2n + l 

1 J a °' 2n+1 (6 + 2)2 (l +fo )2(n + l) (6 + 2 )2 + fe + 2 

ra n+l+fe 



n + 1+ (TT6p2-L L ^ < J 



, (1 + 6)* 

fc=0 i=2fc+l v / \ > 



, Q1fl > 6 + 3 1 2n 

(3.1U) a ,2n - 77 ; oW + 71 , nofa+iw,, , M> + 



(6 + 2)2 (i + 6)2(n+i)(6 + 2) 2 '6 + 2 
n + 1+ (TT6p2-L 1^\ % ) 



, (1 + 6)* 

fc=0 i=2fc+2 x / \ > 
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We have analogous result for cor. 3.1. From 2.3 we have 

2n 2n+l 

(3.11) 0,k+l,2n = / ;Clk,i, dk+l,2n+l = £_^ M- 

i=0 i=0 

We employ (3.11) in RHS of (3.9, 3.10) and use (3.4-3.6) to get after simplification, 

Corollary 3.2. 

6 2 + 56 + 7 1 (2n + l) 2 (2n + l)(36 + 8) 

ai,2n+i - ( & + 2 )3 ~ (l + 6) 2 (™+ 1 )(6 + 2) 3 + 2(6 + 2) + 2(6 + 2) 2 

(312) (n + l)(n + 2) | V-g^ fc ft + l + *^ 6* 



6+1 ' ^ ^ ^ V j / (1 + 5)i+2+fc 

i=l fc=0 j=2fc+2 v J ' v y 

+ EE E 



j j(i + by+ 2 + k ' 

i=0 fc=0 j=2fc+l x J y v y 



6 2 + 56 + 7 1 2n 2 n(36 + 8 



(6 + 2) 3 (l + 6) 2m + 1 (6 + 2) 3 6 + 2 (6 + 2) 



2 



n i—1 i+l+k 



i—i z- — n .;— oi._i_o \ J / 



l + b) i+2+k 
j=l fc=0 j=2fc+2 v 



ra-1 j i+l+fc / . _. , N , , 

^— > ^-^ ^— > /?+l+/e\ b> 



j n + b) i+2 + k ' 

One can then apply the similar procedure for k = 1,2,... thereby producing the 
claimed sequence of identities. The combinatorial aspect of these identities is illustrated 
in the next section. 



4. S^n), S*3(n) and binomial coefficients 

The interesting case in (3.12, 3.13) is 6 — > —2. By continuity of RHS the limit exists 
and we note that binomial coefficients satisfy the following well-known equation 



(4 1) ( 2n + 2 + k \ _ 2 (2n + 3 + k\ /2n + 4 + A;\ (2n + 2 + k 
{L ' V 2n ) \2n + l) + \2n + 2)~\2n + 2 

(A 2) ( 2n + 3 + k\_j2n + 4 + k\ (2n + 5 + k\ _ (2n + 3 + k 
By applying (4.1, 4.2) to (3.12, 3.13) we get that 
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Corollary 4.1. 

6 2 + 56 + 7 1 (2n + l) 2 (2n + l)(36 + 8) 

im ^~ 2 (6 + 2) 3 " (l + 6) 2 (™+ 1 )(6 + 2) 3 + 2(6 + 2) + 2(6 + 2) 2 
2n + 4 N 
2n + l, 

6 2 + 56 + 7 1 2n 2 n(36 + 8 



(6 + 2) 3 (l + 6) 2 ™+ 1 (6 + 2)3 6 + 2 (6 + 2) 2 
2n + 3' 
2n 



Applying cor. 4.1 to (3.12, 3.13) we obtain 
Corollary 4.2. 



n i—1 i+l+k 



;;; ) = {n+1){n+2)+ ^^ E (•+ +*y ( _ 1)1+ « + * 

' i=l fc=0 j=2fc+2 ^ J ' 

+ EE E ( + )2'(-i) 1+ - + ' +l , 

j=0 fc=0 jr=2fe+l \ ■> ' 

(44) ( 2 " 2 D = (" +i ) 2+ ee'e C +1 , + *) 2 ' ( - i)1+,+ '" 

^ ' i=l fc=0 j=2fc+2 \ J J 

n—1 i i+l+k ,. 1 , -. 

+ EE E f + . + *V(-i) i+ «+'. 

j=0 fc=0 jr=2fe+l V J / 

We next show how it is possible to express sums of powers of integers of the form YJ i p , 
where p = 2, 3 as a sum of binomial coefficients. To see how this is done we have from 

[7] 



n— 1 n+l+k , 1 , 

(4.5) «(«+i) = (-ir +1 E E ( F" 1 ^ 1 



j— 1 / -i \i+/c 

i ,■ r 

k=0 i=2k+2 



n n+l+k 

I i i I ! / ■ \ 

-1/ i\i+fc 



'» n -t-ij- K • _i_ 1 i lA 

(4.6) (n + l) 2 = (-l) n+1 ^ J2 { J H-ir.n^O. 

fc=0 j=2fc+l ^ ' ' 

Employing (4.5, 4.6) in (4.3, 4.4) gives 

(4.7) ( "+) = (n+l)(n + 2) + 2-^.-(. + l) + 2-^(z + l) 2 , 

^ ' i=l i=0 

(4.8) ( 2n 2 + 3 ) = (n+l) 2 + 2-X:-^ + l) + 2-X:^ + l) 2 . 



i=l i=0 
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Simplification of (4.7,4.8) leads to, 



(4.9) J:}) - 5> + * 

\ ' / j=0 

(«o) ( 2 ;: 3 ) = E(* + D'- 

v 7 i=0 



Summing (4.9, 4.10) yields, 



2n + 4\ (In + 3\ 2 ^ 2 , 2 

l) + ( 2n ) = £'• 



2n + 

2n+l 

2n - 1 ) V 2n 



We also note the well-known formula 



n _. • 

(4.11) ^. 2 = ^-(n + l)-(2n + l) = - 



n , _ x _ ., l/2n + 2 

4 V 2n - 1 



which is implied by (4.9). 

Moreover, these ideas can be extended by applying (3.11,4.1,4.2) to corollary 3.2. We 
obtain, setting b = — 2 that 



2n + 4 
2n 



n—\ 

-&2,2n = — i 2_^ ai > 2i ~^~ Z_^ a l,2i+l 
, i=0 i=0 

l-l l+l+k 



(4.12) 



Ef'+^-EEt-^E E ( +fc )(-i)^- 1 

i=0 i=l J=l fc=0 j=2fc+2 \ •> S 

n i—1 I l+l+k ,-. i , 7 \ 

EE-D'^E E ( . + *)(-D' + '2'- 



i=l Z=0 fc=0 j=2k+l 

n-1 n-1 i l-l l+l+k 



+ E(* +!)(>+ 2 ) -EE(-d'- 2 E E ( )'-i"-2--' 

i=0 i=l !=1 fc=0 jr=2fc+2 ^ J 

n-1 i I l+l+k 



EEf-D'^E E ( ■ jt-D^" 1 . 



i=0 Z=0 fc=0 j=2fc+l 
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/2n + 5\ AA A 

\2n+l ~ ~ a ^ 2n+1 ~ ~ I 2^ ai < 2i + Z^ a L2i+i 



,i=0 i=0 

n n i I— 1 Z+l+fc // , -i _i_ K\ 

(4.i3) = B^ 1 ) 2 -EB- 1 )' - 2 £ E ( , jc-iy^-- 1 

i=0 i=l Z=l fc=0 j=2k+2 \ •> / 

n i—1 I l+l+k // , -i , L.\ 

- ££(-i)'-2£ £ ( , + 1 + k )(-i)^2>-> 

i=l 1=0 fc=0 j=2k+l V J / 

n n i l—l l+l+k ,, _. , -. 

+ J> + !)(; + 2) -££(-!)<• 2 £ ^ (' + +fc )(-l)^-' 

i=0 i=l i=l fc=0 jr=2fe+2 \ J J 

n i I l+l+k /; , -I _i_ K\ 

- ££(- 1 >'' 2 £ £ ( . + *)(-D^-'. 

i=0 «=0 fc=0 j=2k+l \ J / 



We have by summation, from (4.5,4.6), 



i i l—l l+l+k ,, 1 | ? \ 

(4.i4) E / -( / + 1 ) = E(- 1 ) m -E E ( 7 )2- l (-i)^, 

Z=l J=l fc=0 j=2fc+2 \ J J 



i i I l +l+ k ,, 1 , . \ 

(4.i5) E^ + ^Et-^E E ( + ? + )2- 1 (-i)^. 

i=l J=l fc=0 j=2fc+l V J / 

Employing (4.14, 4.15) in (4.12, 4.13) we find that, 

2n ) = ££2-M< + i) + ££2-« + i) 2 + £(> + i) 2 

' i=l «=1 i=l 1=0 i=0 

n— 1 i n— 1 i n— 1 

( 4 -!6) + ^^2-/-(/ + l) + ^^2-(/ + l) 2 + ^(. + l)-(z + 2), 

i=i i=i i=o ;=o i=o 

(„ _\ n i n i—1 n 

'^ ) = ££2-M< + D + ££2-(' + i) 2 + £(* + i) 2 
' i=i i=i j=i ;=o «=o 

n i n i n 

( 4 - 17 ) + EE 2 - / -( / + 1 ) + EE 2 -( / + 1 ) 2 + E^ + 1 )-^+ 2 )- 



j=i ;=i i=o ;=o 
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We consider (4.16,4.17) and find that 

n— 1 i n i— 1 



2n + 4 
2n 



(4.18) 



j=l Z=l i=l (=0 

n ra— 1 

+ ]>> + l) 2 + 35> + !)(« + 2 )' 



i=0 



i=0 



2n + 5 



n i— 1 



(4.19) 



= 4.£2>(i + i) + 4.££.(j + i)' 

i=l Z=l i=l 2=0 

n ra 

+ 3 .£(z + l) 2 + J> + l)(i + 2). 



i=0 i=0 

We simplify (4.18) employing (4.11) to get 

ra— 1 i ra— 1 i— 1 



(~ .\ ra— ± / xt— i i— i ra 

/ i— i ;— 1 i— 1 ;— i ;— i 



i=l (=1 



i=l J=l Z=l 



£ 



2.| a + 2 ) + 4.(' + 1 
2i-l) V 2 



(4.20) 



i=l 

1 ^ / 1/ + I 

i=0 



n + 2 
2 



We find, after straightforward simplification of (4.20) that, 



(4.21) ]T(2*) 3 = 3 



i=l 



2n + 4\ 9 (In + 2 



13 



71+1 



71+1 

1 



Using the fact that 



2n + 4 
2n 



2n + 4 
2n+l 



2n + 5 
2n + l 



and (4.9) we find that 

n 

£ 



i=\ 

so that 



\ (8 • i 3 + 12 • i 2 + 6 • i + 1) + 6 • i 2 + — • % + ^- 



2n + 5 
2n + l 



£, + ir- a. (--) _|. (--)- 3i. cr)--cr) 



(4.22) 
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Summing (4.21,4.22) gives 



2n+l 

E 



i'2n + 5\ „ /2n + 4\ „ (In + 2\ _ /n + l\ „„ /n + 1 



3- 



3- 



9- 



44. 



17- 



1 



= 3 -( to 4 +5 ) +3 -( fc 4 +4 )- 9 -( 2n 3 +2 )- 44 -C , 2 1 

(4.23) 

We next use the fact that 
(4.24) (2n + l) 3 = 3 



17-.- -it."- 1 



2n + 2\ /2n + 3 
3 ) " V 3 



3( '2(n + l)\ /2n + l 



Thus, for evenly many terms from (4.23, 4.24) 

In 



v^ % (In + 5\ /: 



2n + 4\ n /2n + 2\ ^ /n + 1 
4 "M 3 - 44 ' 2 



17- 



n + 1 



-3 



2n + 2\ /2ra + 3 
3 + 3 



2(n + T 



3.M +3 f" 4 + 4 )-3.( 2 " 3 + 3 



/n + 1\ / n \ (n — 1 

" •' 2 ~ 13 ' 1 - 15 ' 



2n + l 
1 



1 2,'- 3 -) +3 (- 1 



(4.25) 
Finally, we present, without proof, a variation of a similar result in [9]: 



3=1 

1 /p + 1 



/; ) = / x p dx, 
o 



p + 1 u + 1 



•1V +1 . 
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Abstract : In this paper we observe reflection symmetries of the g-extension of Genocchi polynomi- 
als, c„ ]( j(x), using numerical investigation. By numerical experiments, we demonstrate a remarkably 
regular structure of the complex roots of the c n ^ q (x) for q < 0. Finally, we give a table for the 
solutions of the g-extension of Genocchi polynomials. 

Key words : g-extension of Genocchi numbers, g-extension of Genocchi polynomials, Roots of q- 
extension of Genocchi polynomials, Reflection symmetries of the g-extension of Genocchi polynomials 
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1. Introduction 

In [2], T. Kim constructed the g-extension of Genocchi numbers c n , q and polynomials c n ,q(x) using 
generating functions. In order to study the g-extension of Genocchi polynomials c n ,q(x), we must 
understand the structure of the g-extension of Genocchi polynomials c nq (x). Therefore, using 
computer, a realistic study for the g-extension of Genocchi polynomials c n _ q (x) is very interesting. 
For related topics the interested reader is referred to [3, 4, 5, 6]. The main purpose of this paper is 
to consider reflection symmetries of the g-extension of Genocchi polynomials, c n>9 (a;) for values of 
the index n by using computer. First, we introduce the g-extension of Genocchi polynomials c n>q {x) 
(see [2, 3]). Let g be a complex number with |g| < 1. We consider the following generating functions: 

t n _t_ ~ (2n + l)(l-gf n ) / 1 Y' 1 . -*„_!*" 



71=0 

and 



w-E^-^E^^i (-ir-'s- "» 



n=0 n=0 H \ y/ 



By simple calculation in (2), we have 



ra-l 



^/n\(2* + l)(l-««) / i \ , .w-i lx 



^ x)= h^) i-W {—,) { - i) qlx - 

When x = 0, we write c n ,q — c n , g (0), which are called the g-extension of Genocchi numbers. c n>q (x) 
is a polynomial of degree = n in q x . Since 

£ G„(l - x) <=^ = F(l - x, -t) = -Z* e V-)l-*) 

n—0 



n 



e xt = -F(x,t) = -Vt7„(x)-, 

z — • nl 



n=0 
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we obtain that 



G„(x) = (-l) n+1 G n (l-x). 



(3) 



We prove that G n (x),x £ C, has Re(x) = \ reflection symmetry in addition to the usual Im(x) = 
reflection symmetry analytic complex functions. The question is: what happens with the reflection 
symmetry (3), when one considers the q-extension of Genocchi polynomials? We are going now to 
reflection at | of x on the ^-extension of Genocchi polynomials. For n > 0, we have 



C n,q\ X l 



n >q 



l (l-x) = (-l) n - 1 q n c^ q (x). 



(4) 



(4) is the g-analog of the classical reflection formula (3). c* nq (x)(q > 0) has Im(x) — reflection 
symmetry analytic complex functions (Figure 3). c* q {x) has not Re(x) — 1/3 reflection symmetry ( 
Figure 3). If c* n Jx) = 0(q > 0), then cj^l -x) = c* n>q {x*) 
complex conjugation. 



c* i(l — x*) = 0, where * denotes 



2. Zeros of the c* q (a;) 

In order to study c* „ (x) , we must understand the structure of the g-extension of Genocchi 
polynomials. In this section, by numerical investigation, we examine properties of the figures, look 
for patterns, and make open problems. First, we display the shapes of the c* Ax) and we investigate 
the zeros of the c* q {x) . For n = 1, • • • , 10, we can draw a plot of the c* q (x), respectively. This shows 
the ten plots combined into one. We display the shape of c n ^ q (x), c* q (x), n = 1, • • ■ , 10, —1 < x < 1. 



C L,(- V ) 




C ,*.«W 




Figure 1: Curvers of d^ q (x),q = 1/3 Figure 2: Curvers of c* nq {x),q = 1/2 

We investigate the beautiful zeros of the c* (x) by using a computer. We plot the zeros of 
the c* q (a;) for n = 30, 40, 50, 60, q = 1/3 and x € C. (Figure 3). We plot the zeros of the c* g (a;) 
for n = 30,40,50,60,g = —1/3 and x £ C. (Figure 4). We plot the zeros of the c n _ q (x) for 
n= 30, 40,50,60,g= -1/3 and x £ C. (Figure 5). 

We observe a remarkably regular structure of the complex roots of the c* q {x). We hope to 
verify a remarkably regular structure of the complex roots of the c* Ax) (Table 1) . Next , we calculate 
an approximate solution satisfying c* q (x),x £ R. The results are given in Table 2. 
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Im(x) - 



{ 



Re(x) 




Im{x) 




1.5 

1 

0.5 

lm(x) 

-0.5 
-1 

-1.5 




Re(x) 



Figure 3: Zeros of c* (x) for n = 10, 20, 30, 40, q = 1/3 



Table 1. Numbers of real and complex zeros of c* q (x) 



degree n 


q 


1 

~ 3 


o=\ 


real zeros 


complex zeros 


real zeros 


complex zeros 


2 





1 


1 





4 





3 


1 


2 


6 





5 


1 


4 


8 





7 


3 


4 


10 





9 


3 


6 


12 





11 


3 


8 


14 





13 


3 


10 
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Im(x) - 



f^- 






-0.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



Im(x) 



P\ 



.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



Im{x) - 



P\ 



Im(x) 



-0.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



P\ 



-0.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



Figure 4: Zeros of c* Jx) for n = 10, 20, 30, 40, q = -1/3 



Table 2. Approximate solutions of c* Jx) = 0, q = 1/3, x £ 



degree n 


a; 


2 


0.0653041 


3 


-0.19615, 0.268175 


4 


0.452437 


5 


-0.248071, 0.609023 


6 


0.743617 


7 


-0.144954, 0.861249 


8 


-0.35968, -0.0411608, 0.965587 


9 


0.0527129, 1.05928 


10 


-0.397998, 0.137934, 1.14427 
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Im(x) 



k^ 



lm(x) 



-0.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



^ 



.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



Im{x) 



^ 



tiu(x) 



-0.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



m 9 



-0.75-0.5-0.25 0.25 0.5 0.75 1 

Re(x) 



Figure 5: Zeros of c„, q (x) for n = 10, 20, 30, 40, q = -1/3 



Table 3. Approximate solutions of c n _ q (x) = 0, q= 1/3, x € 



degree n 


X 


2 


0.0653041 


3 


-0.19615, 0.268175 


4 


0.452437 


5 


-0.248071, 0.609023 


6 


0.743617 


7 


-0.144954, 0.861249 


8 


-0.35968, -0.0411608, 0.965587 


9 


0.0527129, 1.05928 


10 


-0.397998, 0.137934, 1.14427 



We calculated an approximate solution satisfying c n . q (x), c* {x),q = —1/3, a; G M. The results are 
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^ivcn in Table 4 and Table 5. 

Table 4. Approximate solutions of c* „(x), g = -l/3,ieC 



degree n 


X 


2 


-0.055099 + 0.1575611 


4 


-0.263443 - 0.136379i, -0.082275 + 0.235274i, 
0.291022 + 0.0575166i 


6 


-0.279677 + 0.0472548i, -0.257121 - 0.289292i, 

-0.0911906 + 0.260769i, 0.189278 + 0.211248i, 

0.381371 -0.0660123i 


8 


-0.309726 - 0.083174H, -0.260268 + 0.127805i, 

-0.241352 - 0.386845i - 0.095592 + 0.273355i, 

0.123901 + 0.262149i, 0.294061 + 0.12797i, 

0.429832 - 0.152132* 



Table 5. Approximate solutions of c n ^ q {x),q = —1/3, a; G 



degree n 


X 


2 


-0.055099 - 0.157561? 


4 


-0.263443 + 0.136379i, -0.082275 - 0.235274i, 
0.291022 - 0.0575166i 


6 


-0.279677 - 0.0472548i, -0.257121 + 0.289292i, 

-0.0911906 - 0.260769i, 0.189278 - 0.211248?, 

0.381371 + 0.0660123 


8 


-0.309726 + 0.0831741?, -0.260268 - 0.127805i, 

-0.241352 + 0.386845i - 0.095592 - 0.273355i, 

0.123901 - 0.262149i, 0.294061 - 0.12797i, 

0.429832 + 0.152132i 



3. Directions for Further Research 

Finally, we shall consider the more general problems. Prove or disprove: c* (x) = has 
n — 1 distinct solutions. Find the numbers of complex zeros C c * ( x ) of c* „{x), Im(x) ^ 0. Prove or 
disprove: Since n — 1 is the degree of the polynomial c* n „(x), the number of real zeros R c * i x \ lying 
on the real plane Im(x) = is then R c * i x \ = n— 1 — C c » i x \, where C c » i x \ denotes complex zeros. 
See Table 1 for tabulated values of R c * ^ and C c » r x y The open question is: what happens with 
the reflection symmetry (4), when one considers the c* „{x) for q < ? (Sec Figures 4, 5). Find the 
equation of envelope curves bounding the real zeros lying on the plane. The author has no doubt 
that investigation along this line will lead to a new approach employing numerical method in the 
field of research of the c* Jx) to appear in mathematics and physics. The reader may refer to [3, 4, 
5, 6] for the details 
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On Best Simultaneous Approximation in Semi 

Metric Spaces 

H. K. Pathak 1 and Satyaj Tiwari 2 



Abstract 

In this paper, the existence of invariant best simultaneous approx- 
imation in semi metric space is proved. In doing so, we have used a 
recent result of Moutawakil regarding the fixed points for set-valued 
mappings. 

1. Introduction 

In the realm of best approximation theory, it is vaible, meaningful and po- 
tentially productive to know whether some useful properties of the function 
being approximated is inherited by the approximating function. In this per- 
spective, Meinardus [8] observed the general principle that could be applied, 
while doing so the author has employed a fixed point theorem as a tool to 
establish it. The result of Meinardus was further generalized by Habiniak 
[5], Smoluk [15] and Subrahmanyam [16]. 

On the other hand, Beg and Sahazad [2], Fan [4], Hicks and Humphries 
[6], Reich [10], Singh [13], [14] and many others have used fixed point the- 
orems in approximation theory, to prove existence of best approximation. 
Various types of applications of fixed point theorems may be seen in Klee [7] , 
Meinardus [8] and Vlasov [18] . Some applications of the fixed point theorems 
to best simultaneous approximation is given by Sahney and Singh [11]. For 
the detail survey of the subject we refer the reader to Cheney [3]. 

In this paper, we prove the existence of invariant best simultaneous ap- 
proximation in semi metric space, while doing so, we use the recent result of 
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Moutawakil [19] on the fixed points for set-valued mappings. 



2. Preliminaries and Definitions 

Let (X, d) be a metric space. Let(CB(X),H) denote the hyperspace of 
nonempty closed bounded subsets of X, where H is the Housdroff metric 
induced by d, that is, 

H(A, B) = maxjsup d(a, B), sup d(b, A)} 

a&A b&B 



for all A,Be CB(X), where d(x, A) = mi yGA {d(x,y) : x e X} and Ad X. 

Although the fixed point theory for single valued maps is very rich and well 
developed, the multivalued case is not. Note that the multivalued mappings 
play a major role in many areas as in studying disjunctive logic programs. 
On the other hand; it has been observed that (see example [22]) that the 
distance function used in certain metric theorems proofs need not satisfy the 
triangular inequality nor d(x, x) = for all x. Motivated by this fact, Hicks 
and Rhoades [22] established some common fixed point theorems in sym- 
metric spaces and proved that very general probabilistic structures admits a 
compatible symmetric or semi-metric. Recall that a symmetric on a set X 
is a non negative real valued function d on X x X such that (i)d(x, y) — if 
and only if x — y, (ii) d(x, y) = d(y, x). 

In order to unify the notation, we need the following notation: 
(WA) Given {x n },{y n } and x in X, lim d(x n ,x) = and lim d(x n ,y n ) = 

imply that lim d(y n ,x) = 0. 

A sequence in X is called a d— cauchy sequence if it satisfies the usual metric 
condition. X is S— complete if for every d— cauchy sequence (x n ), there exists 
x in X with lim d(x n ,x) = 

The HausdorfF distance in a symmetric space. 

Definition. Let (X, d) be a symmetric space and A a nonempty subset of X. 

(i) We say that A is d — closed iff A d = A where 
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A d = {x eX : D(x, A) = 0} and d(x, A) = inf{d(x, y) : y G A} 

(ii) We say that A is d-bounded iff Sd(A) < oo where 5d{A) = sup{d(x,y) : 
x,y G A} 

The following definition is a generalization of the well-known Hausdorff dis- 
tance to the setting of symmetric case. 

Definition. Let (X, d) be a d— bounded symmetric space and let C(X) be 
the set of all nonempty d— closed subset of (X,d). Consider the function 
H : 2 X x 2 X -> R defined by 



H(A, B) = maxjsup d(a, B), sup d(b, A)} 

aeA b£B 

for all A,BeC(X). 

Remark. It is easy to see that (C(X),D) is a symmetric space. 

Now we give the notion of convex structure introduced by Gudder [20] (see 
also, Petrusel [21]). 

Let X be a set and F : [0, 1] x X x X — > X a mapping. Then the pair 
(X, F) forms a convex ^restructure. Let (X, F) be a convex prestructure. If 
F satisfies the following conditions: 

(i) F(X,x,F(^y,z)) = F(X+(l-X) l 2, F(X(X + (1 - A)//)" 1 , x, y),z) for 
every A, // 6 (0, 1) with A + (1 — X)fi ^ and x,y,z G X. 
(ii) F(X, x,x) = x for any x G X and A G (0, 1), 

then (X, F) forms a semi-convex structure. If (X, F) is a semi-convex struc- 
ture, then 

(SCI) F(l, x, y) — x for any x, y G X. 

A semi-convex structure is said to be regular if 

(SC2) A < /i =>• F(A, x, j/) < F(/i, x, y) where A, fi G (0, 1). 

A semi-convex structure (X, F) is said to form a convex structure if F 
also satisfies the conditions 
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(Hi) 
(iv) 



F(X, x, y) = F(l — A, y, x) for every A G (0, 1) and x, y G X. 
if F(X, x, y) = F(X, x, z) for some A^l,xGl then y — z. 



Let (X, F) be a convex structure. A subset Y of X is called (a) F- 
starshaped if there exist p G Y so that for any x G F and A G (0, 1), F(X, x,p) G 
Y. (b) F-convex if for any x, y in Y and A G (0, 1), F(X, x,y) G Y. For 
F(X,x,y) = Xx + (1 — A)y, we obtain the known notion of starshaped con- 
vexity from linear spaces. Petrusel [20] noted with an example that a set can 
be a F-semi convex structure without being a convex structure. Let (X, F) 
be a semi-convex structure. A subset Y of X is called F semi- starshaped if 
there exists p G Y so that for any x G F and A G (0,1), F(X,x,p) G F. 
A Banach space X with semi-convex structure F is said to satisfy condition 
(Pi) at p G K (where K is semi-starshaped and p is star centre) if F is 
continuous relative to the following argument : for any x, y G X, X G (o, 1) 

II (F(A,x,p) -F(X,y,p) < X \\ x-y \\ . 



To prove our main result (Theorem 3.1 below), we shall make use of the fol- 
lowing result due to Moutawakil ([19], theorem 2.2.1). 

Theorem A. Let (X, d) be a d— bounded and S— complete symmetric space 
satisfying (W — 4) and let T : X — > C(X) be a set-valued mapping such that 

H(Tx,Ty) < kd(x,y) 
for all x, y G X, where k G [0, 1). Then there exist u E X such that u G Tw. 

Let (X, d) be a metric space and G a nonempty subset of X. Suppose 
A G C(X), the set of nonempty d— closed subsets of (X, d),then we write 



r G (A) = inf geG sup aeA d(a,g) 

cent G (A) = {g eG : sup aeA d(a, g ) = r G (A)}. 

The number r G (A) is called the Chebyshev radius of A w.r.t G and an el- 
ement 2/0 G cent G (A) is called a best simultaneous approximation of A w.r.t 
G. If A = {x}, then r G (A) = d(x,G) and centc(^) is the set of all best 
approximations of x out of G. We also refer the reader to Milman [9] for 
further details. 
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3. Main Results 

Now we state and prove our main result. 

Theorem 3.1 Let (X, d) be a d-bounded and S-complete semi-metric space 

with semi-convex structure satisfying condition (Pi) and T : X — > C(X) 

be a set- valued mapping. Let G G C(X). For A C C(X), if centc(A) is 

nonempty, compact, semi-starshaped, T-invariant and T satisfy the following 

conditions: 

(i) T is continuous on cento {A), and 

(ii) H(Tx,Ty)<kd(x,y) 

for all x,y G centc(A) with x 7^ y,d(x,y) > 0, then centc(A) contains a 
T-invariant point. 

Proof. Let p be the starcentre of centc{A). Then F(x,p, A) G centc(A) for 
each x G centc(A). Let {A:„}^Li be a real sequence with < k n < 1 such 
that fc n — > 1 as n — > 00. Define T„ : centc(^4) — > C(centc(yl)) by 

T n x = F(k n ,Tx,p) = (J F(k n ,y,p) 

yGTx 

for all x G centc(A). Since p is semi-starcenter of cento (A) and T (cent g{A)) C 
centc(A), It follows that T n maps centc(A) to itself for each n. Now applying 
condition (Pi), we obtain 

H(T n x,T n y) = H(F(k n ,Tx,p),F(k n ,Ty,p)) 

<k n H(Tx,Ty)) 
< k' n d(x,y) 
i.e., 

H(T n x, T n y) < k' n d(x, y) 

for all x,y G cento(A), It follows by Theorem A that each T n has a fixed 
point, say z n . Since centc(A) is complete, {^ n } has a convergent subsequence 
{z ni } such that z ni — *■ 2; (say) as i — > 00 for some z6l Since 
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and k ni — »■ 1 as % — ► oo, it follows that z G Tz. Hence centc{A) contains a 
T— invarient point. This complete the proof. 

Remark 3.2 Let (X, d) be a d-bounded and S-complete semi-metric space 
with semi-convex structure satisfying condition (Pi) and let T be a self map 
on X. Let G £ X. For A C X, if centciA) is nonempty, compact, semi- 
starshaped, T-invariant and T satisfy the following conditions: 

(i) T is continuous on centc{A), and 

(ii) (Tx, Ty) < kd(x, y) 

for all x,y e centc(A) with x 7^ y,d(x,y) > 0, then centc(A) contains a 
T-invariant point. 



References 



[1] Beg, I. and Azam, A., Fixed points of multivalued locally contractive 
mappings, Boll. U.M.I., (7) 4-A (1990),227-233. 

[2] Beg, I. and Shahzad, N., An application of a fixed point theorem to best 
approximation, Approx. Theory and its Appl., 10:3(1994), 1-4. 

[3] Cheney, E. W., Application of fixed point theorems to approximation 
theory, Theory of Approximations, Academic Press (1976), 1-8. 

[4] Fan, Ky., Extension of two fixed point theorems of F.E.Browder, Math 
Z.,112 (1969), 234-240. 

[5] Habiniak, L., Fixed point theorems and invarient approximations, J. Ap- 
proximation Theory, 56(1989), 241-244. 

[6] Hicks, T.L. and Humphries M.D., A note on fixed point theorems, 
J. Approximation Theory, 34 (1982), 221-222. 

[7] Klee, V., Convexity of chebyshev sets, Math. Ann., 142(1961), 292-304. 

[8] Meinardus, C, Invaiauz Bei Lineaeu Approximation, Arch. Rational 
Mech.Anal., 14(1963), 301-303. 

[9] Milman, P.D., On best simultaneous approximation in normed linear 
spaces, J. Approximation Theory, 20(1977), 223-238. 



PATHAK-TIWARI: BEST SIMULTANEOUS APPROXIMATION 629 



[10 
[11 

[12 
[13 
[14 
[15 
[16 

[17; 

[18 
[19 

[20 
[21 
[22 



Reich, S., Approximate selection,best approximations, fixed points and 
invarient sets, J.Math.Anal.Appl., 62(1978), 104-113. 

Sahney, B.N. and Singh S.P., On best simultaneous approximation, Ap- 
proximation Theory III, Academic Press (1980), 783-789. 

Singh, S.P., Application of fixed point theorems in approximation theory, 
Applied Nonlinear Analysis, Academic Press (1979), 389-394. 

— -, Application of a fixed point theorem to approximation theory, J. 



Approx. Theory, 25(1979), 88-89. 

— -, Some results on best approximation in locally convex spaces, J. 
Approx. Theory, 28(1980), 72-76. 

Smoluk, A., Invarient approximations, Mathematyka [Polish], 17(1981), 
17-22. 

Subrahmanyam, P.V., An application of a fixed point theorem to best 
approximations, J. Approx. Theory, 20(1977), 165-172. 

Takahashi, W., A convexity in metric spaces and nonexpancive map- 
pings I, Kodai Math. sem. Rep., 22(1970), 142-149. 

Vlasov, L.P., Chebyshev sets in Banach spaces, Soviet Math. Polody, 
2(1961), 1373-1374. 

Driss El Moutawakil, A fixed point Theorem for multivalued maps in 
symmetric spaces, Applied Mathematics E- Notes, 4(2004), 26-32. 

Gudder, S.P., A general theory of convexity, Rend. Sem.Mat.Milano, 
49,(1979), 89-96. 

Petrusel, A., Starshaped and fixed points, Seminar on fixed point theory 
(Cluj-Napoca), Stud.Univ."Babes-Bolyai", Nr.3,(1987), 19-24. 

T.L. Hicks and B.E. Rhoades, Fixed point theory in symmetric spaces 
with applications to probabilistic spaces, Nonlinear Analysis 36 (1999), 
331-344. 



1 H. K. Pathak 

School of Studies in Mathematics 

Pt. Ravishankar Shukla University, Raipur 

(C.G) 492010, India 



630 PATHAK-TIWARI: BEST SIMULTANEOUS APPROXIMATION 



E-mail : hkpathak@sify.com 

2 Satyaj Tiwari 

Department of Mathematics 

Shri Shankaracharya Institute of Professional Management and Technology, 

P.O. Sejabahar, Raipur 

(C.G) 490021, India 

E-mail : tsatyaj@yahoo.co.in 



JOURNAL OF CONCRETE AND APPLICABLE MATHEMATICS, VOL.8, NO.4, 631-637, 2010, COPYRIGHT 2010 EUDOXUS PRESS, LLC 



ON BEST UNIFORM APPROXIMATION OF PERIODIC 
FUNCTIONS BY TRIGONOMETRIC POLYNOMIALS 

MICHAEL I. GANZBURG 



Abstract. We discuss inequalities of the form C\E n (f') L * < -EVi(/)c* < 
C'2E n (f') L * between the errors of approximation in the uniform and integral 
metrics of periodic functions / and /' by trigonometric polynomials. As ap- 
plications, we obtain upper and lower estimates for E n (f)c* in terms of the 
Fourier coefficients of a function /. 



1. Introduction 

Let C^jr be the Banach space of all 27r-periodic continuous functions / on the 
real line M with the finite norm ||/||c* := m & x a;e[o,27r) |/(x)|; L^ the Banach 
space of all 27r-periodic measurable functions / on [0,27r) with the finite norm 

I l/l l L 2x : ~ /o l/( x ) M x > an d 1~ n the class of all trigonometric polynomials of degree 
< n. For n = 0, 1, . . . , we define the approximation errors by 

E n(f)c L ■= ^ 11/ - T n \\ CL , E n (f)L L ■= T inf^ ||/ - T n \\ Liw . 

Throughout the paper C, C\, . . . denote positive constants independent of n. The 
same symbol does not necessarily denote the same constant in different occurrences. 
In this paper we discuss upper and lower estimates for the error of uniform 
approximation of periodic functions by trigonometric polynomials in terms of their 
Fourier coefficients. The estimates for E n (f)c* are based on inequalities of the 
form 



CMn)£„(/')L. < E n {f) c * < C 2 (n)E n (f) 



L%„ 



where C 2 (n) = 1/2, n > 1 (Theorem 2.1) and Ci(n) = [4(n+l)] l ,n>\ (Theorem 
3.1). Our approach is illustrated by five example. 

2. Upper Estimates 

We first discuss upper estimates for E n (f)c* in terms of the Fourier coefficients 
of an individual function /. Jackson's theorems are inapplicable in this case, while 
various linear approximation methods have proved to be efficient [8], [1] [9], [4]. In 
particular, if the scries 

oo 

f(x) = ao/2 + 2_^ iflk cos kx + bk sin kx) (2-1) 

fe=i 



Key words and phrases. Best approximation, trigonometric polynomials, Fourier coefficients. 
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is absolutely convergent, then the following trivial estimate 

DC 

E n (f)c L < J2 d flfc l + l 5fe D 
k=n+l 

generated by the Fourier approximation can be efficient for rapidly decreasing 
Fourier coefficients [4] as well as for slowly decreasing ones. The latter state- 
ment is supported by the estimate E n (f\)c* < C(n + 1)~ A+1 , where f\(x) :— 
X^fcli k~ x coskx, A > 1. Lower estimate (3.14) given in Section 3 shows that the 
upper estimate cannot be improved. Note that for an integer A > 1 an asymptotic 
formula for E n (f\)c* was found in [2]. 

Here we consider upper estimates for E n (f)c* , which can be applied to con- 
ditionally and absolutely convergent series (2.1). Our approach is based on the 
following simple result for functions of bounded variation. 

Theorem 2.1. Let f G C% n be an absolutely continuous function with f G L\^. 
Then the following inequality holds: 

E n {f) CL <{l/2)E n {f) L ^. (2.2) 

Proof. Let B\(x) := Y^k=i k~ 1 smkx be a Bernoulli function and let Q n G T n be 
a polynomial of best mean approximation to /'. Then the following relation holds 
[5, Eq. 1.5.1]: 

/(*) - a /2 + - [ Bi(x- t)f'{t)dt. 

7T ./n 



In addition, the function 



1 



7T 



2 7 ;- 



T n (x) := a /2 + - / B x {x - t)Q n (t)dt 



belongs to T n . Therefore, 



2tt 



E n (f)c L <\\f-T n \\ c ^ < - sup |Bi(*)| / \f'(t)-Q n (t)\dt 

71 tG[0,27r) JO 

= (1/2)£„(/')l- 
This proves (2.2). □ 



There are several results on finding or estimating E n (g)L* . Three of them are 
given in the following lemma. 

Lemma 2.1. (a) If a sequence {-B/j}^ n+1 of the Fourier coefficients of an odd 
function g{x) — X^fcLi B^smkx G L* 2v is 2-monotone, that is, B^ > 0, B^ — B^ + i < 
0, and Bk — 2Bk+\ + B^+2 > for k > n + 1, then g G L\ v and 

m— 

(b) If a sequence {Ak} k x L n+1 of the Fourier coefficients of an even function g{x) = 
Aq/2 + X^feLi ^fc cos kx is 3-monotone, that is, A^ > 0, A^ — A^+i < 0, A^ — 
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2A k+ i+A k+2 > and A k -3A k+ i+3A k+2 -A k+3 < for k > n+l, then g e L 27r 
and 

oo A 

rn—O 

(c) If g(x) = A /2 + X)feLi + cos kx e L* 27T , then 

E n {g)Li„ < C f \A n+1 \ + \A 2n+2 \ + J^ k\A k+n - 2A k+n+1 + A k+n+2 \ J . 

Statements (a) and (b) of Lemma 2.1 were proved by Nagy [6] (see also [8, 
Sections 2.11.5 and 2.13.32]), while statement (c) was established by the author 
[4, Lemma 3.4]. Note that the condition g <G L* 2t . in Lemma 2.1(a) is equivalent 
to the statement that g is the Fourier scries and, in addition, it is equivalent to 
convergence of the series Yl^kLi-Bk/k. 

Combining Theorem 2.1 and Lemma 2.1, we obtain upper estimates for E n (f)c* ■ 
Let us illustrate this approach by the following examples. 

Example 2.1. f(x) = £~ 2 ^, q > 0. Then the series /'(*) = £~ 2 gjf 
converges at every x € (0, 27r) and /' e L 2v since {log -9 fc}^L 2 ^ s a convex sequence 
[10, Theorem 5.1.5]. Moreover, {\og~ q k}% x L 2 is a 3-monotone sequence. Therefore 
by Theorem 2.1 and Lemma 2.1(b), we obtain for n > 1, 

x / i \ m 

E n(f)c^ < (1/ 2 )- E ™(/')l* x < 2 \] — — — ^r— — — — -r 

2 " 2 " *-^ (2m+ l)iog i [{2m + l){n+ 1)\ 

< 21og - «(n + l). (2.3) 

Note that in case q G (0, 1] the Fourier series for / is conditionally convergent. 
In addition, we remark that an estimate 

E n (f)c^< Clog- q (n + l) (2.4) 

can be deduced from Lemma 1(c) as well. Indeed, setting A k := log" 9 k, k > 2, we 
have for n > 2 

oo oo 

^ k\A k+n - 2A k+n+ i + A k+n+2 \ = ^2 (k - n)\A k - 2A k+1 + A k+2 \ 

A;— n+l k— n+l 

oo oo 



< J2 k max \d 2 (log- q y)/dy 2 \ <C ^ fc -1 log -(<?+1) k 

A;— n+l — — k— n+l 

<Clog - V+l). 
This implies (2.4). 

Example 2.2. f(x) = £^Li k x exp(-Ak a ) coskx, X e R, A > 0, a > 0. Then 
f'(x) — -~ £fe^Li & A+1 exp(— ^4fc Q )sinA:a; e Ljjw Since for n large enough, the 
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sequence {fc A+1 exp(— A k a )}^L n+1 is 2-monotone for n > no, we obtain from The- 
orem 2.1 and Lemma 2.1(a) for n > uq 

E n (f)c L < {l/2)E n {f) LU = 2(n + 1) A+1 

x ( cxp(-A (n + 1) Q ) + ]T (2m + 1) A cxp(-,4 (2m + l)"(n + 1)") J 

< 2(n+l) A+1 

x (cxp(-yl(n + l) a )+ / (2y+l) A cxp(-^(2y+l) Q (n+l)")dy 



< 2(l + Cn~ a )(n+l) A+1 cxp(-A(n+l) Q ) 

< C(n+l) A+1 cxp(-A(n+l) Q ). (2.5) 

3. Lower Estimates 

Efficient lower estimates for E n (f)c* , where / is an odd or even function with 
monotone Fourier coefficients, are given in the following lemma. 

Lemma 3.1. (a) For n > 1 and f{x) = a$/2 + J^fcLi a k cos kx G C^, where 
{ a k}Z= n +i * s a P os iti ve non-increasing sequence, the following inequality holds: 

E n {f)a >(l/4)sup(iY+l) aA r + „. (3.1) 

New 

(b) For n > 1 and f(x) = '^2 < ^' =1 bkSmkx G C^, where {pk}^— n +\ is a positive 
non-increasing sequence, the following inequality holds: 

E n {f) Ci >(l/4)sup(JV + l)6 JV+n . (3.2) 

JV6N 

Inequalities (3.1) and (3.2) follow from more general estimates obtained by New- 
man and Rivlin [7] and the author [3] , respectively. Lower estimates for other classes 
of functions were developed in [4]. 

Lower estimates for E n (f)c* , where a function / of bounded variation satisfies 
some additional conditions, are based on the following theorem, which is interesting 
in itself. 

Theorem 3.1. Let f G C% n be an absolutely continuous function with f G L^. 
In addition, we assume that if Q n G T n is a polynomial of best mean approximation 
to f , then f — Q n =/= a. e. on M. and there exists f3 G [0,27r) such that f — Q n 
has exactly 2(n + 1) sign changes on [/3, 2tt + [3). Then 

E„(f)c L > [4(n+ l)]- 1 ^/')^- (3-3) 

Equality in (3.3) holds for the function 

4 ^ cos[(2m+l)(n+l)g] 

Mx) = ~^+i) 2. ^rip • (3 ' 4) 

Proof. Let Q n G T n be a polynomial of best approximation to /' in L% n and let 
x\, . . . , X2(n+i) be the points from (/3, 2ir + /?), in which /' — Q n changes its sign. 
Setting Xq := (3, £2(n+i)+i := 2n + f3 and using the criterion of best approximation 
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of /' in L^ [8, Section 2.8.1], we have for any polynomial T n e T n 

[2ir+f3 2(n+l) + 1 

= / T n (x)sgn(f(x)-Q n (x))dx = ± V (-1) 4 / T n (x)dx 

(2n+l rXi+i r-2ir+l3 \ 

2 S (_1)4+1 / ^M** 5 - / r„(ar)da; . (3.5) 

Next, the following trivial relation holds for any constant C: 

2n+l 

£(-ir +1 C = 0. (3.6) 

i=0 

Further applying (3.5) and (3.6) to T n (x) — sinfcx or T n (x) — coskx, 1 < k < n, 
we obtain that for any T* (x) = X)fc=i( afc cos ^ x + ^fc sm ^• z -)j 

2n+l 

^(-ir +1 T,:( a ; 4+1 ) = 0. (3.7) 

»=0 

It follows from (3.6) that (3.7) is valid for any T* £ T n . Let us define a measure /z 
with its support on {x\, . . . , X2( n +i)} by 

At(x i+ i) = 2(-l) i+1 , 0<i<2n+l. (3.8) 

Then \x is orthogonal to T n since for any T* G 7^, (3.7) is equivalent to the relation 

l-2-n+fi 

/ T:(x)d^( X ) = 0. (3.9) 

Therefore taking account of the facts that /' — Q n ^ a. e. on R and /' — Q„ 
has exactly 2(n + 1) sign changes on [p,2ir + (3), we get from (3.5), (3.6) (3.8), and 
(3.9) that 

r2TT+P 

E n (f')L ilr = / (f(x) - Q n (x))sgn(f'(x) - Q n (x))dx 

2(n+l) Xi+1 r 2TT+fJ 



± I 2 ^ (-l) i+1 y (f(x)-Q n (x))dx- J (f'(x)-Q n (x))dx 

<v Wl iE n (f) c *, (3.10) 



//3 J/3 

2(n+l) 



= ±2^ ("l) i+1 /(^ + i 
where 



27T + /3 

f(x)d(j,{x) 





varit:=/ |/z(a;)| = 4(n+ 1). (3.11) 

Therefore (3.3) follows from (3.10) and (3.11). To complete the proof, we note that 
the function /„ defined by (3.4) is continuous and ,f' n (x) — sgnsin[(n+ l)x] € L^- 
Since J f^(x)T*(x)dx = for any T* e 7^, we conclude that Q„(x) — is a 
polynomial of best mean approximation to /^. Moreover for < (3 < ir/(2n + 2), 
the function f' n —Q n = f' n has exactly 2(n+l) sign changes on (/?, 27T+/3). Therefore 
/„ satisfies all conditions of Theorem 3.1. Finally we have 

E n {f n ) LU = f"\ftix)\dx = 2n, (3.12) 

Jo 
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and by Chebyshev's theorem, 

Enifnhi, = \\fn\\c L = 7r/(277 + 2), (3.13) 

since for k = 0, 1, . . . , 2n + 1, 

/_fc7r_\ fc+1 4 ^ i = ( _ 1)k+ i * 

Jn \n+l) { ' 7^77+1)^(2777+1)2 l ' 2(77+1)" 

Thus (3.12) and (313) imply 

Sn(.f„)c L = [4(77+l)]- 1 S„(/;) L|x . 

This completes the proof of the theorem. □ 



The following corollary shows that estimate (3.3) holds for functions satisfying 
Nagy's conditions. 

Corollary 3.1. If the Fourier coefficients of a function g = f E L* 2lx satisfy 
conditions of Lemma 2.1(a) or Lemma 2.1(b), then inequality (3.3) holds. 

Proof. Nagy [6] (see also [8, Sections 2.11.5 and 2.13.32]) proved that if conditions 
of Lemma 1(a) or Lemma 1(b) arc satisfied for g{x) = f'(x) = X^fe=i -Bfcsinfcx or 
g(x) = f'(x) = Aq/2 + Y^k=i Ai-coskx, then sin(n + l)x(f'(x) — Q n (x)) > or 
cos(n + l)x (f'(x) — Q n {x)) > and /' — Q n + 1 a. c. on R. Here Q n is a polyno- 
mial of best mean approximation to /'. Therefore the conditions of Theorem 3.1 
are satisfied and (3.3) follows. □ 



Lower estimates of Lemma 3.1 and Corollary 3.1 are used in the following ex- 
amples. 

Example 3.1. f{x) = J2^L 2 klo k *k ' 1 > ®- Then using Lemma 3.1(b), we have 

> 1 N + 1 1 C 

n(/)c 2 -„ > ~ A sup (iv + n)log9(7V + n) - 8 i og 9( 2n ) - log^n+l)" 

Example 3.2. f(x) = YX=i fcA exp(-Ak a ) coskx, A E K, A > 0, a > 0. Then 
for 77 large enough, the Fourier coefficients of /' satisfy the conditions of Lemma 
2.1(b). Therefore by Corollary 3.1, 

E n (f)c; L > [4(n + l)]- 1 ^/')^ - (l/2)(n+ 1) A 
x ( exp(-A (77 + 1) Q ) + J2 ( 2m + !) A CX P(-^ ( 2 ™ + !)"(" + !)") J 

V m=0 / 

> C(77 + 1) A CXp(-^(77 + 1)"). 

Example 3.3. f(x) = Y^kLi k~ x coskx, A > 1. Then by Lemma 3.1(a), 

E n (f)c L > \ sup ( nl\ > C(n+1)-^. (3.14) 

4 nen (N + n) A 

In the following corollary, we combine upper and lower estimates of Examples 2.1, 
2.2, 3.1, and 3.2. 
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Corollary 3.2. (a) For f(x) = £~ 2 ™jfc, q > 0, 

C 2 log- q (n+l) < E n (f) CL < G\log- q (n+l). 
(b) For f(x) = YX=i fcA exp(-A k a ) cos h, A e K, A > 0, a > 0, 

n — >oo ^ 
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Abstract. Suppose that E is a real normed linear space, C is a nonempty convex 
subset of E and T : C — ► C is a Lipschitz generalized 5>-hemicontractive mapping. 
Under suitable conditions on the iterative parameters, we show that the Mann iter- 
ative sequence with errors converges strongly to the unique fixed point of T. 

Key Words: 0-hemi contractive mapping; generalized <3?-hemi contractive mapping; 
Mann iterative sequence with errors; fixed point 
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1 INTRODUCTION 

Let E be an arbitrary real normed linear space with dual space E* and C be a nonempty 
subset of E. We denote by J the normalized duality mapping from E to 2 defined by 

J[x) = {x* £ E* : (x,x*) = \\x\\ 2 = \\x*\\ 2 }, VieB, 

where (•, •) denotes the generalized duality pairing. 

A mapping T : C — ► E is called (ft-hemicontractive if the fixed point set F(T) = {x £ C : 
Tx = x} is nonempty and there exists a strictly increasing function <p : [0, oo) — ► [0, oo) with 
(f>(0) = such that, for all x £ C and x* £ F(T), there exists j(x — x*) G J(x — x*) satisfying 

(Tx — x*,j(x — x*)} < \\x — a;* || — <fi(\\x — x*||)||x — x*||. 

T is called generalized &-hemicontractive if the fixed point set F(T) is nonempty and there 
exists a strictly increasing function & : [0, oo) — ► [0, oo) with ^(0) = such that 

(Tx - x*,j(x - x*)) < \\x - x*\\ 2 - $(||x - a5*||) (1.1) 

holds for all x £ C, x* G F(T) and for some j(x — x*) £ J(x — x*). 

Generalized ^-hemi contractive mapping is also called uniformly hemi contractive in [1, 2]. 
It is well known that this kind of mappings play important roles in nonlinear analysis. 



*This research is supported by the Education Committee project Research Foundation of Chongqing (Grant 
No. KJ070806) and Chongqing Key Laboratory of Operations Research and System Engineering. 

'E-mail address: xch@cqnu.edu.cn (C.H. Xiang), zhe505@yahoo.com.cn (Z. Chen), kequanz@163.com (K.Q. 
Zhao). 
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By taking 3>(s) = scf)(s), where 4> : [0,oo) — ► [0, oo) is a strictly increasing function with 
4>(0) = 0, we know that the class of i^-hemicontractive mappings is a subset of the class of 
generalized <J>-hemicontr active mappings. The Example 1.1 below demonstrates that the class 
of Lipschitz 0-hemicontractive mappings is a proper subset of the class of Lipschitz generalized 
<J>-hemicontractive mappings. 

Example 1.1. Let E = K be the reals with the usual norm. Define T : E — > E by 

Tx = x k, V x € E. 

1 + x z 

Then T is Lipschitz and T has the unique fixed point x* = G E. Let 3> : [0, oo) — ► [0, oo) 

2 

be defined by 3>(s) = ttt^- Then 3>(s) is a strictly increasing function with ^(0) = 0. For all 
x £ E, we have 

(Tx-Tx*,x-x*) = (Tx,x) = x 2 ^— ;, = |x| 2 -$(|x|) = Is - x*| 2 - $(|x - x*\), 

1 + x z 

so that T is a Lipschitz generalized <J>-hemicontractive mapping. Since 

(Tx — Tx* ,x — x*} = \x — x*\ — 3>(|x — x*|) < |x — x*\ — <fi(\x — x*\)\x — x*\ 

holds for all x £ E if and only if <fi(s) < t^^-j for all s > 0, we know that T is not </>- 
hemicontractive since such a function <f> is not increasing. 

Many results have been proved on convergence or stability of Ishikawa iterative sequences 
(with errors) or Mann iterative sequences (with errors) for Lipschitz ^-hemicontractive map- 
pings (see, e.g., [3-8] and the references therein). Recently, there are some authors studied the 
convergence of the iterative sequences (with errors) involving generalized ^-hemicontractive 
mappings (see, e.g., [1,2,9-11] and the references therein). In 2005, Chidume and Chidume [11] 
proved the following results: 

Theorem CC1 [11, Theorem 2.2]. Let E be a real normed linear space and T : E — > E 
be uniformly continuous. Let {x n } be a sequence in E defined iteratively from an arbitrary 
x G E by 

x n -\-i — ci n x n T" o n i x n T" c n u n , n ^_ u, 

where {a n }, {b n }, {c n } are sequences in [0, 1] satisfying the following conditions: 
(i) a n + b n + c n = l, V n > 0; 

oo 

(ii) E ( fo n + c n ) = oo; 

n=0 

oo 

(iii) J2 (bn + c n ) 2 < oo; 

n=0 

oo 

(iv) J2 c n < oo and such that 

n=0 

(Tx„ - x*,j(x n - x*)) < \\x n - x*\\ 2 - ®(\\x n - x*\\), V n > 0, 

where $ : [0, oo) — ► [0, oo) is a strictly increasing function with $(0) = 0, and {u ra } is a 
bounded sequence in E. Then {x n } is bounded. 

Theorem CC2 [11, Theorem 2.3]. Let E be a real normed linear space, K be a nonempty 
subset of £7 and T : .fT — ► E be a uniformly continuous generalized ^-hemicontractive mapping, 
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i.e., there exist x* £ F(T) and a strictly increasing function <£ : [0, oo) — ► [0,oo), ^(0) = 
such that for all x £ K, there exists j(x — x*) £ J(x — x*) such that 

(Tx n - x*,j(x n - x*)) < \\x n - x*\\ 2 - ®(\\x n - ic* ||), V n > 0. 

(a) If y* G K is a fixed point of T, then y* = x* and so T has at most one fixed point in K. 

(b) Suppose there exists xq G K, such that the sequence {x n } defined by 

X n -\-l — (l n X n T" n l X n T" C n U n: Tl ^_ U, 

is contained in i'T, where {a n }, {b n } and {c n } are real sequences satisfying the following 
conditions: 

(i) a n + b n + c n = 1; 

oo 

(ii) E {K + c n ) = oo; 

n=0 

oo 

(iii) J2 ( b n + c„) 2 < oo; 

n=0 

oo 

(iv) J2 c n < oo; and {u n } is a bounded sequence in .E. 

n=0 

Then {x n } converges strongly to x* . In particular, if y* is a fixed point of T in X, then {x n } 
converges strongly to y* . 

The proof of conclusion (b) in Theorem CC2 is based on Theorem CC1. However, there is 
a gap in the proof of Theorem CC1. In fact, in the proof of Theorem CC1, in order to prove 
||x n — x*|| < 2<I> _1 (ao) for all n > by induction, where ao is a positive number, by assuming 
that \\x n — x*\\ < 2<£ _1 (ao) and ||x n +i — x*|| > 2<& -1 (ao) hold for some n, the authors of [11] 
established following inequality (see [11, page 551]) 

||x n+ i - x*|| 2 < \\x n - x*\\ 2 - a„$(2$- 1 (a )) + M x o? n + c n p (1.2) 

for the same n. Unfortunately, (1.2) does not imply that 

n n oo n 

$(2$~ 1 (a ))^a j < Y, (\\xj-x*\\ 2 -\\x j+1 -x*\\ 2 ^ + M^u] + pJ2 c i> 

j=0 n=0 j=0 j=0 

since (1.2) holds only for one given natural number n. Therefore, the result of Theorem CC2 
may be not true since its proof is based on Theorem CC1. 

On the other hand, it has been proved in [6] that if T : K — ► K is a Lipschitz <p- 
hemi contractive mapping and c n = for all n > 0, then the conclusion (b) in Theorem CC2 
holds. Since the class of Lipschitz 0-hemicontractive mappings is a proper subset of the class 
of Lipschitz generalized <I>-hemi contractive mappings, this leads to the following questions. 

Question 1. Suppose that T : C — > C is a Lipschitz generalized $-hemi contractive 
mapping. Does the result in [6] hold? 

Question 2. Suppose that T : C — ► C is a Lipschitz generalized <3?-hemi contractive 
mapping. Does the conclusion (b) of Theorem CC2 hold? 

The main purpose of this paper is to give affirmative answer to Questions 1 and 2. 
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2 PRELIMINARIES 

The following lemmas will be used in the proof of our main results. 

Lemma 2.1 (See, e.g., [11]). Let E be a real normed linear space. Then for all x,y £ E, 
we have 

lk + y|| 2 < \\x\\ 2 +2(y,j(x + y)), \fj(x + y) G J(x + y). 

Lemma 2.2 (See, e.g., [12]). Let {a n }, {b n }, {c n } be three nonnegative sequences satisfying 
the following condition: 

a n +i < (1 + b n )a n + c„, Vn> n , 

oo oo 

where no is some nonnegative integer, J2 b n < oo and J2 c n < oo. Then the limit lim a n 

n=no n=no n— »oo 

exists. 

Lemma 2.3. Suppose that there exists a natural number uq such that a n , b n , c n and (5 n 
are nonnegative real numbers for all n > no satisfying the following conditions: 

(i) a n+1 < (1 + 6 n )a n - /3 n <£>(a n +i) + c n , V n>n , 

oo oo 

(h) J2 b n < OO, ^ C n < OO, 
n=no n=no 

oo 

(iii) X) /3„ = oo, 

n=no 

where <£ : [0, oo) — > [0, oo) is a strictly increasing function with y(0) = 0. Then lim a n = 0. 
Proof. By condition (i), we have 

«n+i < (1 + b n )a n + Cn, V n > n . 
Using condition (ii) and Lemma 2.2, we obtain that lim a n exists and so \a n } is bounded. 

n^oo 

Suppose that lim a n = a and a n < M (V n > no), where a, M are nonnegative constants. 

ra— »oo 

oo 

Let d n = Mb n + c n . Then Yl d n < oo by condition (ii). It follows from condition (i) that 

a n +i <a n - (3 n (p(a n+ i) + d„ ( V n > n ). (2.1) 

Now, we prove that a = 0. If a > 0, then there exists a nonnegative integer N > uq such that 
a«+i > | for all n > N. Since 92 is strictly increasing, we have V2(a n+ i) > y(|) > for all 
n > N. It follows from (2.1) that a n+ \ < a n — /3 n <^(|) + d n (\/n> N) and so 

00 00 

00 = ^(o^ ^ ^ n - aN + ^ dn < °°' 

which is a contradiction. Therefore, lim a n = a = 0. This completes the proof. □ 

n-^00 

Remark 2.1. Lemma 2.3 is different from Lemma 3 in [13], which requires that b n = 
for all n > and c n = o{(3 n ). 
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3 MAIN RESULTS 

Theorem 3.1. Let E be a real normed linear space, C be a nonempty convex subset of 
E and T : C — ► C be a Lipschitz generalized 5>-hemicontractive mapping. For given xo £ C, 
suppose that the sequence {x n } C C is the Mann iterative sequence with errors defined by 

x n+1 = a n x n + (3 n Tx n + 7„n n , n > 0, (3.1) 

where {u n } is a bounded sequence in C and {a n }, {/3 n }j {in} are sequences in [0, 1] satisfying 
the following conditions: 

(1) a n + /3 n + 7 „ = l(Vn>0); 

oo 

(2) £ /?„ = oo; 

n=0 

oo oo 

(3) £ /£ < OO, £ 7n < OO. 
n=0 n=0 

Then {x n } converges strongly to the unique fixed point of T in C. 

Proof. It follows from (1.1) that F(T) = {x G C : Tx = x} is singleton. Let F(T) = {p} 
and M = sup{||u n — p\\ : n> 0}. Then M < oo and 

\\u n - x n \\ < \\u n -p\\ + \\p- x n \\ < M + \\x n -p\\. (3.2) 

By (1.1), there exists j(x n — p) £ J(x n — p) such that 

{Tx n -p,j(x n -p)) < \\x n -p\\ 2 - $(||x n -p\\), Vn > 0, (3.3) 

where $ : [0, oo) — ► [0, oo) is a strictly increasing function with ^(0) = 0. Let L be the 
Lipschitz constant of T. By Lemma 1.1, (3.1) and (3.3), we obtain 

\\ x n+i-p\\ 2 = \\a n (x n -p) + (3 n {Tx n -p) +7„(u„ -p)\\ 2 

< a 2 n \\x n -p\\ 2 + 2f3 n {Tx n -p,j(x n+1 - p)) 
+2~/ n (u n -p,j(x n+ i -p)) 

< a 2 n \\x n -p\\ 2 + 2(3 n (Tx n+1 -p,j(x n+ i - p)) 
+2{3 n (Tx n - Tx n+1 , j(x n+1 -p)) +27„M||x n+ i - p\\ 

< a 2 n \\x n -p\\ 2 + 2[i n [||x n+ i-p|| 2 -$(||x n+ i -p\\)] 

+2j3 n L\\x n -x n+ i|| • ||x n+ i -p\\ +27„M||x„ + i - p\\, V n > 0. (3.4) 

From (3.1) and condition (1), we have 

%n %n+\ — Pn\%n J- %n) Iny^n X n ). 

It follows from (3.2) that 

||x n -x n+ i|| < (3 n \\x n -Tx n \\ +7„(M+ \\x n -p\\). (3.5) 

Observe that 

\\x n -Tx n \\ < \\x n -p\\ + ||p-Tx n || < (1 + L)\\x n - p\\. (3.6) 

Taking (3.6) into (3.5), we obtain 

\\x n -x n+ i|| < [(1 + L)/3 n + 7„] \\x n -p\\ +M7„. (3.7) 
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Taking (3.7) into (3.4), we have 

||x n+ i-p|| 2 < al\\x n -p\\ 2 + 2f3 n [\\x n+1 - p\\ 2 - <S>(\\x n+1 - p\\)] 

+2(5 n \\x n -p\\ + a n )\\x n+1 -p\\, (3.8) 

where S n = L(5 n [(1 + L)0 n + j n ] , a n = (Lf3 n + l)Mj n , V n > 0. By condition (3), we have 

oo oo 

Y^ °~n < OO, Yl ° n < °°- ( 3 - 9 ) 

n=0 n=0 

Denote a n = \\x n — p\\ 2 ( V n > 0) and y(s) = 2<£(y / s). It follows from (3.8) that 

a n +\ < a n a n + 2(3 n a n +i - 0n^>{a n +i) + 2(5 n y/a n ~ + (T n )y/On+i, V n > 0. 

Noting that < a n < 1 — (3 n , we obtain 

fln+i < (1 -/?n) 2 a„ + 2/3 n a„+i - (3 n (p(a n+1 ) + 5 n (a n + a n+ i) + a n (l + a n+ i) 

= (1 -2f3 n + P 2 . + 5 n )a n + (2(3 n + 5 n + a n )a n+ i- p n f(a n+ i) +a n . (3.10) 

It follows from (3.9) and condition (3) that lim (2(3 n + 5 n + o~ n ) = 0. Thus, there exists a 

n — >oo 

natural number no such that 2(3 n + 5 n + a n < \ for all n > no- Let 

_ 1 - 2[3 n + /g + 5 n /?* + 25 n + o n 

On — - ~ „ r- ' 



J- ^Pn n 0~n *■ ^Pn On @n 

<?n 

1 — 2(3 n — 5 n — <Tn 

By (3.10), 

a n +\ < (1 + &„)a n - /3„^(a„ + i) + c„, V n > n . 

Since ^ < 1 — 2/3 n — <5 n — cr n < 1 for all n > no, 

< b n < 2(01 + 25 n + o-„), < Cn < 2o n ,Vn > n . 



It follows from (3.9) and condition (3) that £ ° n < oo and J2 c n < oo. Therefore, by 

n=no n=no 

using Lemma 2.3 and condition (2), we obtain that lim \\x n — p\\ 2 = lim a n = 0. That is, 

n— »oo n— »oo 

lim ||x ra — p\\ = 0. This completes the proof of Theorem 3.1. □ 

n— +oo 

Remark 3.1. Theorem 3.1 gives an affirmative answer to Question 2. 

Taking 7„ = for all n > in Theorem 3.1, we have the following result. 

Theorem 3.2. Let E be a real normed linear space, C be a nonempty convex subset of 
E and T : C — ► C be a Lipschitz generalized <I>-hemicontractive mapping. For given xo £ C, 
suppose that the sequence {x n } C C is the Mann iterative sequence defined by 

X n +1 = (1 - /?n)x n + f3 n Tx n , U > 0, 

where {/?«} is a sequence in [0, 1] satisfying the following conditions: 

oo 

(1) E 0n = oo; 
n=0 

oo 

(2) £ 2 n < oo. 

n=0 

Then {x n } converges strongly to the unique fixed point of T in C. 

Remark 3.2. Theorem 3.2 gives an affirmative answer to Question 1. 
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Abstract. In this paper, strong convergence of Modified Mann type iteration 
is used to find some common fixed point of a countable family {Tn}„^i of 
noncxpansive mappings in Banach space, and their proof is different from ones 
of Aoyama ct al. [Approximation of common fixed points of a countable family 
of nonexpansive mappings in a Banach space, Nonlinear Analysis, 67(2007) 
2350-2360] and other existing results which is independent of the convergence 
of the implicit anchor- like continuous path zt, defined by zt = tu + (1 — i)Tzt- 

Key Words and Phrases: Countable family of noncxpansive mappings; 
Modified Mann type iteration; Mann type iteration; uniformly Gateaux diffcr- 
cntiablc. 



1. Introduction 

Let E be a Banach space and E* be its dual space. Let K be a nonempty closed 
convex subset of E and T : K — > K be a mapping. T is said to be non-expansive 
if \\Tx-Ty\\ < \\x-y\\ for all x, ye K. 

In order to find a fixed point of noncxpansive mapping T, Mann [14] and Halpcrn 
[7] respectively introduced the iteration procedure in a Hilbert space as follows (We 
refer them to as Mann iteration and Halpern iteration): 

x n+i = (1 - a n )Tx n + a n x n (1.1) 

and 

x n +i = (1 - a n )Tx n + a n u, (1.2) 

where {a n } is a sequences in [0,1]. Subsequently, Mann iteration and Halpern 
iteration ware studied extensively over the last twenty years for constructions of 
fixed points of nonlinear mappings and of solutions of nonlinear operator equations 
involving monotone, accretive and pseudocontractive operators. For example, [3, 
4, 11, 12, 13, 16, 19, 20, 21, 26, 27, 28, 31, 32, 33, 35, 36] and many other results 
which isn't mentioned here. 

The modified version of Mann iteration and Halpern iteration were investigated 
widely by many mathematic workers. For example, Kim-Xu [10] and Chidumc- 
Chidume [5] dealt with the strong convergence of the following iterative scheme 
(so-called Modified Mann iteration) for a non-expansive mapping T: for x , u G K, 

Xn+i = a n u + (1 - a n )(/3 n x n + (1 - l3 n )Tx n ). (1.3) 
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where a n ,/3 n G [0,1]. Song-Chen [25] researched strong convergence of Modified 
Mann iteration (1.3) in the frame of reflexive Banach space which is complementary 
and development of the above results. 

Recently, for a nonexpansive mappings sequence {T„}„^ with some special con- 
dition, Jung [9] and O'Hara et al. [17, 18] respectively studied strong convergence 
of the following iteration: for xo, u G K, 

x n+1 = a n u + (1 - a n )T n x n , (1.4) 

where a n € [0, 1] such that (CI) linin^oo a n = and (C2)^ n=1 a n = oo. Unfortu- 
nately, there was a gap in the proof lines of their main results. With the purpose of 
overcoming the gap, Song-Chen [29, 30] introduced the conception of a uniformly 
asymptotically regular for {T„}„^ an d proved several strong convergence results 
by using the conception. Other investigation of approximating common fixed point 
for countable family of nonexpansive mappings by means of the iteration (1.4) can 
be found in Refs [1, 22, 24] and many results which isn't cited here. 

Very recently, still for a nonexpansive mappings sequence {T n }^f^ with some 
specific condition, Aoyama et al. [2] introduced Mann type iteration procedure: let 
X\ G K and 

x n +i = a n x n + (1 - a n )T n x n , (1.5) 

where a n G [a, b] C [0, 1], and showed its strong and weak convergence in uniformly 
convex Banach space. At the same time, Song [23] also carefully researched the 
convergence of the iteration (1.5) by the aid of the uniformly asymptotically regular 
of {T n }^=i in a reflexive Banach space. 

In this paper, for a countable family {T„}„^ of nonexpansive mappings with 
some appropriate condition (see section 3) , we will introduce the following iteration 
procedure: let x\, u G K and 

x n+ i = a n u + (i n x n + (1 - a„ - /3 n )T n x n , (1.6) 

and show its strong convergence in Banach space when a n ,[3 n G [0, 1] satisfy the 

oo 

conditions lim a n — 0, ~}2 a n = oo and < liminf j3 n < lim sup /3„ < 1. We also 

n^oo n=Q n^oo ' n^oo 

go on exploring the weak convergence of the Mann type iteration (1.5) under the 
condition < lim inf a n < lim sup a n < 1 . 

n >oo n — >QO 

2. Preliminaries and basic results 

Throughout this paper, the fixed point set of T is denoted by F(T) := {x e 
K; Tx = x}. Let E be a real Banach space and let J denote the normalized duality 
mapping from E into 2 E given by 

J(x) = {/ G E*, (x, f) = \\x\\\\fU\x\\ = ||/||},V xeE, 

where E* is the dual space of E and (•, •) denotes the generalized duality pairing. 
We write x n — ^ x (respectively x n —^ x) to indicate that the sequence x n weakly 
(respectively weak*) converges to x; as usual x n — > x will symbolize strong conver- 
gence. In order to show our main results, the following conceptions and lemmas 
are needed. 

Let S(E) := {x e E; \\x\\ = 1} denote the unit sphere of a Banach space E. A 
Banach space E is said to have 
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(i) a Gateaux differentiable norm (we also say that E is smooth), if the limit 

lim "g + frll-M (*) 

t-o t 

exists for each x, y G S(E); 

(ii) a uniformly Gateaux differentiable norm, if for each y in S(E), the limit (*) 
is uniformly attained for x G S(E); 

(iii) a Frechet differentiable norm, if for each a; G S(E), the limit (*) is attained 
uniformly for y G S(E); 

(iv) a uniformly Frechet differentiable norm (we also say that E is uniformly 
smooth), if the limit (*) is attained uniformly for (x,y) G S(E) x S(E). 

(v) /jrred point property for noncxpansivc self-mappings , if each noncxpansive 
self-mapping defined on any bounded closed convex subset K of E has at least a 
fixed point. 

A Banach space E is said to be (vi) strictly convex if 

x ~\- uW 
IMI = \\y\\ = 1, £ 7^ y implies < 1; 

(vii) uniformly convex if for all e G [0, 2], 3<5 £ > such that 

x ~t~ ?y 
||x|| = ||y|| = 1 implies < 1 — S e whenever \\x — y\\ > e. 

(viii) The subset K of E is a Chebyshev set, if V:r G 25, there exactly exists unique 
element y G K such that ||x — y|j = d(x, K) = inf{||x — z\\; z G if}. 

The following results is well known which are found in reference[8, 34, 15]: the 
normalized duality mapping J in a Banach space E with a uniformly Gateaux dif- 
ferentiable norm is single-valued and strong-weak* uniformly continuous on any 
bounded subset of E; each uniformly convex Banach space E is reflexive and 
strictly convex and has fixed point property for noncxpansive self-mappings; ev- 
ery uniformly smooth Banach space E is a reflexive Banach space with a uniformly 
Gateaux differentiable norm and has fixed point property for nonexpansive self- 
mappings; every nonempty closed convex subset is a Chebyshev set in strictly con- 
vex and reflexive Banach space (see [15, Corollary 5.1.19]); Each weakly compact 
convex subset is a Chebyshev set in strictly convex Banach space E (see [8, Lemma 
9.3.7]). 

Lemma 2.1 ([33, Lemma 2.2]) Let {x n } and {y n } be two bounded sequences in 
a Banach space E and f3 n G [0,1] with < liminf/3„ < lim sup (3 n < 1. Suppose 

n >oo n — >QC 

x n+ \ = f3 n x n + (1 — f3 n )y n for all integers n > 1 and 

limsup(||y n+ i - y n \\ - \\x n+i - x n \\) < 0. 



Then lim \\x n — y n \\ = 0. 

n — >oo 

Lemma 2.2 (see [8, Lemma 9.3.6]) Let C be a weakly compact subset in Banach 
space E and let f : E — > R be a weakly lower semi-continuous function. Then the 
inflmum of f is achieved in C. 

In the proof of our main theorems, we also need the following definitions and 
results. Let /ibea continuous linear functional on ^°° satisfying ||^|| = 1 = /x(l). 
Then we know that /i is a mean on TV if and only if 

inf{a„; n G N} < /x(a) < sup{a n ; n G N} 
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for every a = (ai, 02, • • • ) € ^°°. According to time and circumstances, we /Lt n (a„) 
instead of fi(a). A mean fi on N is called a Banach limit if 

Mij(flre) = fn(flti+l) 

for every a = (01, 02, • • • ) € ^°°. Furthermore, we know the following results [35, 34]. 

Lemma 2.3 ([35, Lemma 1]) Let C be a nonempty closed convex subset of 
Banach space E with uniformly Gateaux differentiable norm. Let {x n } be a bounded 
sequence of E and let /j, n be a mean // on N and z G C. Then 

"n\\x n - A? = min/i„||a;„ - y\\ 2 

y£C 

if and only if 

n n (y - z, J(x n - z)) < 0, \fy E C. 



Lemma 2.4 ([32, Proposition 2]) Let a is a real number and (xo,xi, . . .) S l°° 
such that fJ, n x n < 01 for all Banach Limits. Lf limsup(x„ + i — x n ) < 0, then 

n — >oo 

lim sup x„ < a. 

n — >oo 

Lemma 2.5 ([36]) Let {a n } be a sequence of nonnegative real numbers satisfying 
the property 

a n +i < (1 - ln)a n + JnPn, n > 0, 

where {"/ n } C (0, 1) and {[3 n } C M. such that 
00 
(i) J2 In = 00; (ii) limsup/3„ < 0. 

n=0 n^oo 

Then {a n } converges to zero, as n — > 00. 

3. Strong convergence of the modified Mann type iteration 

Let K be a nonempty closed convex subset of Banach space E. Suppose {T n } (n = 

1 , 2, . . .) is a countable family of noncxpansivc mappings from K into itself such that 

00 
F := P| F(T n ) y^ 0. Recently, in uniformly convex Banach space, Aoyama et al. [1] 

n=l 

obtained the strong convergence of modified Halpern type iteration (1.4) if {T n }~^°? i 

and {a„}+f^ C (0, 1] satisfy the following conditions: 
00 
(Bl) ^2 sup ||T„ + ix — T n a;|| < +00 for any bounded subset C of K; 

n=0xeC 

00 
(CI) lim a n — and J^ a n = 00; 

00 
(C2) either lim °" +1 = 1 or ^] \a n +i — oc n \ < +00. 

n ^°° "" n=0 

Their proof also depend on the following important fact((Bl) implies (B2), see 
[1, Lemma 3.2]): 

(B2) for any bounded subset C of K, there exists a noncxpansivc mapping T of 
K into itself such that 

lim sup \\Tx- T n x\\ = and F(T) = F. 
n ^°°xec 
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In this section, wc introduce the following modified Mann type iteration: for 
Xi,u G K, 

x n +i = a n u + p n x n + (1 - a n - [3 n )T n x n , (3.1) 

and show its strong convergence under the conditions (CI) and (B2) along with 

(C3) < liminf (3 n < limsup/3„ < 1. 

n >oo n — >00 

With the purpose of proving main results, we first show the following lemma 
which doesn't depend on the convergence of the implicit anchor-like continuous 
path Zt = tu + (1 — t)Tz t as many existent results (see [5, 7, 10, 32, 35, 36] and so 



Lemma 3.1 Let K be either a nonempty weakly compact convex subset of a 
strictly convex Banach space E or a nonempty closed convex subset of a reflexive 
Banach space E with fixed point property for nonexpansive self-mappings. Assume 
that T : K — > K is a nonexpansive mapping with F(T) ^ and a bounded sequence 
{x n } of K satisfies 

lim ||x„ + i — Tx„|| = and lim \\x n — x n+ \\\ = 0. 

n — >oo n — >oo 

Suppose that E has a uniformly Gateaux differ entiable norm. Then there exists 
x* G F(T) such that 

limsup(u — x* , J{x n+ i — x*)) < for each u € K. 

n— >oo 

Proof. Let 

g(x) = Hn\\x n - x|| 2 ,Va; e K. 
Then g{x) is continuous and convex on K. Define a set 

K x = {x e K;g(x) = inf ' g(y)}. 

From Lemma 2.2 or the reflexivity of E and the property of g{x) together with the 
boundedness of {x n }, we obtain K\ is a nonempty bounded closed convex subset 
of K and hence weak compact. 
For Vx e Ki, then 

g(Tx) = fi n \\x n - Tx\\ 2 = n n \\x n+ i - Tx\\ 2 

< fin(\\x n+ i - Tx n \\ + \\Tx n - Tx\\) 2 

< Li n \\x n - x\\ 2 = g{x). 

Hence, Tx e K l . Namely, T(K-l) C K x . 

Case 1. Assumed that E is strictly convex. Taking y <E F(T), then there exists 
unique x* <G K\ such that 

\\y-x*\\ = d(y,K 1 )= inf ||y-a;||. 

l£Al 

By Tx* G Ki, we have 

||y-7V|| = \\Ty-Tx*\\ < \\y-x*\\. 

Hence x* = Tx* by the uniqueness of x* in K\. 

Case 2. Assumed that E has fixed point property for nonexpansive self-mappings. 
By T{K X ) C Ki., there exists x* e K x such that x* = Tx* . 

Using Lemma 2.3 and the definition of K\, we get that for u e K, 

fi n {u - x*,J(x n - x*)} < 0. 
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On the other hand, as linin^oo ||a; n +i — x n \\ = together with the norm- weak* 
uniformly continuity of the duality mapping J in Banach space with a uniformly 
Gateaux diffcrcntiablc norm, we have 

lim ((w — x* , J(x n+ i — x*)) — (u — x* , J(x n — x*))) = 0. 

n — >oo 

Hence, the sequence {(u — x* , J(x n — x*)}} satisfies the conditions of Lemma 3.4. 
As a result, we must have 

limsup(u — x* , J{x n+ \ — x*)) < 0. 

n — >oo 

Theorem 3.2 Let E be a reflexive Banach space with a uniformly Gateaux 
differentiable norm and having fixed point property for nonexpansive self-mappings. 
Assume that K is a nonempty closed convex subset of E and {T^}^^ is a countable 

oo 

family of nonexpansive mappings from K into itself such that F := C\ F(T n ) ^ 

and the condition (B2). Let {«„} and {{3 n } be two real number sequences in 
[0, 1] satisfying (CI) and (C3), respectively. Then the modified Mann type iteration 
sequence {x n }, defined by (3.1) strongly converges to some point of F. 
Proof. At first, we show that {x n } is bounded. Taking p e F(T), we have 

\\Xn+l ~P\\ < (!-«« ~ (3n)\\TnXn ~ P\\ + Allien ~ P\\ + OL n \\u - p\\ 

< (1- a„ - p n )\\x n ~p\\ + f3»\\x n -p\\ +a„||w-p|| 

< max{||a; n -p||,||u-p||} 

< max{||.T -p\\,\\u-p\\}. 

Thus, {x n } is bounded, and hence so is {T n x n } by ||T„x„ — p\\ < \\x n — p\\- 
Next we prove that 

lim ||x n +i — Tx„|j = and lim \\x n — x n+ i\\ = 0. (3-2) 

n — >oo n — >oo 

Indeed, let A„ = tz%~ an d z n — X n u + (1 — X n )T n x n . Then 

lim A„ = and x n+1 = (3 n x n + (1 - (3 n )z n . (3.3) 

n — >oo 

Therefore, for some constant M such that M > max{||u||,sup H^a^H} and any 
bounded subset C of K containing {x n }, we have 

ll z n+l — z n\\ = \\^n+l u + (1 — ^n+lj^n+l^n+l — (^n u + (1 — A„)T ra £ ra )|| 

< |A„+i - A„|||u| + ||r„ + ix„ + i - T n x n \\ 
+A„||T„a;„|| + A n+ i||T n+ ia; n+ i|| 

< |A„+i - A„|||u| + ||T n+ ia; n+ i - T n+ ia; n || + \\T n+ ix n - Tx n \\ 
+ \\Tx n - T n x„\\ + (A„ + A„+i)M 

< ||a; n+ i - x n \\ + (|A„+i - A„| + A„ + A„ +1 )M 

+ sup ||T„ +1 a; — Tx\\ + sup \\Tx — T n x\\. 
xec xec 

Thus, by assumption (B2) and (3.3), we have 

limsup„^ 00 (||z„ + i - z„\\ - \\x n+ i -x n \\) 

< lim sup ||r n +ia; — Tx\\ + lim sup \\Tx — T n x\\ 
n ^°° xec n ^°° xec 

+ lim (|A„ +1 - A„| + A„ + A„+i)M = 0. 



651 



COUNTABLE FAMILY OF NONEXPANSIVE MAPPINGS 



By Lemma 2.1, wc obtain 

lim \\x n - z n \\ = 



n — >oo 



and hence 

lim ||x n+ i - x n \\ = lim (1 - (3 n )\\x n - z n \\ = 0. 

n — *oo n — *oo 

Since 

< \\x n+ i - x n \\ + \\x n - z n \\ + \\z n - T n x n \\ + sup \\Tx - T n x\\ 

xec 

< \\x n+ i - x n \\ + \\x n - z n \\ + X n \\u - T n x n \\ + sup \\Tx - T n x\\, 

xec 

then 

lim ||x n+ i -Tx n \\ = 0. 

n — >oo 

By Lemma 3.1, there exists x* e F (T) such that 

limsup(u — x* , J(x n+ i — x*)) < 0. (3.4) 

n^oo 

Finally we show that x n — > x*(n — ► oo). In fact, 

||a; n+ i - x*\\ 2 
= (1 -a„ - /3 n )(T n x„ -x*,J(x n+ i -x*)) + P„(x n -x*,J{x n+ \ - x*)) 
+a n (u - x*,J(x n+ i - x*)) 

< (1 Q„ g\ \\T n x n -x*\\ 2 + \\J(x n+1 -x')\\ 2 | g \\x n -x*f + \\,J{x n+1 -x*)\\ 2 

+a n (u - x*,J(x n+ i - x*)) 

I * I|2 II * ||2 

^ /1 \ \\X n —X . \\X n + i—X | / * T/ *\\ 

< (1 - &n)- 2~^ + " ^2 — + a n[U - X , J{X n+1 - X )) 

Therefore, 

||a;„ + i - x*\\ 2 < (1 - a n )\\x n - x*\\ 2 + 2a n (u- x* ,J(x n+1 - x*)). (3.5) 

By the condition (CI), now we apply Lemma 2.5 to yield 



lim \\x n — x*\\ =0. 

n — >oo 



The proof is completed. 



Using the same proof techniques as Theorem 3.2, the following is obtained easily. 
Since the proof is a repeating work, we omit it. 

Theorem 3.3 Let E be a strictly convex Banach space with a uniformly Gateaux 
differentiable norm. Assumed that K is a nonempty weakly compact convex subset 
of E and {T n }„^°i is a countable family of nonexpansive mappings from K into 

oo 

itself such that F := C] F(T n ) ^ and the condition (B2). Let {a n } and {/3 n } be 

n=l 

two real number sequence in [0, 1] satisfying (CI) and (C3), respectively. Then the 
modified Mann type iteration sequence {x n }, defined by (3.1) strongly converges to 
some point of F. 

Theorem 3.4 Let E be a reflexive and strictly convex Banach space with a 
uniformly Gateaux differentiable norm and K be a nonempty closed convex subset 
of E. Suppose that {T„}^^ is a countable family of nonexpansive mappings from 

oo 

K into itself such that F :— C\ F(T n ) ^ and the condition (B2). Let {a n } and 
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{fin} be two real number sequence in [0, 1] satisfying (CI) and (C3), respectively. 
Then the modified Mann type iteration sequence {£„}, defined by (3.1) strongly 
converges to some point of F. 

Proof. Using the same argumentation as Theorem 3.2, we can show that {x n } 
is bounded and (3.2) holds. Since every nonempty closed convex subset is a Chcby- 
shev set in a strictly convex and reflexive Banach space (sec [15, Corollary 5.1.19]), 
then the conclusion of Lemma 3.1 holds also. The desired ultimatencss is reached. 

Remark 1 (i) There are many spaces which has the fixed point property for non- 
expansive self-mappings. For example, uniformly convex Banach space, uniformly 
smooth Banach space, reflexive Banach space with normal structure, Banach space 
with Opial's condition and so on. 

(ii) We remark that Theorem 3.3 is independent of Theorem 3.2. On the one 
hand, it is easy to find examples of spaces which satisfies the fixed point property for 
nonexpansive self-mappings, which are not strictly convex. On the other hand, it 
appears to be unknown whether a weakly compact convex subset of strictly convex 
Banach space has the fixed point property for nonexpansive self-mappings. 

(iii) In the above theorems, not only the condition (B2) is weaker than (Bl), 
but also the proof is different from ones of Aoyama et al. [1] which isn't dependent 
upon the convergence of the implicit anchor-like continuous path z t , defined by 
Zt = tu + (1 — t)Tz t . 

4. Weak convergence of Mann type iteration 

Recall that A Banach space E is said to satisfy Opial's condition [16] if for any 
sequence {x n } in E, x n —*■ x (n — > oo) implies 

limsup || x n — x\\ < limsup ||x n — y||, Vy G E with x =/= y. 

n — >oo n — >oo 

Hilbert spaces and l p (l < p < oo) satisfy Opial's condition and Banach spaces with 
a weakly sequentially continuous duality mapping satisfies Opial's condition [6]. 

We now show weak convergence of Mann type iteration (1.5) which extend the 
main result of Aoyama et al.[2] from uniformly convex Banach space to reflexive 
Banach space. 

Theorem 4.1 Let E be a reflexive Banach space satisfying Opial's condition 
and K be a nonempty closed convex subset of E. Suppose {T n } (n = 1,2,...) 
is a countable family of nonexpansive mappings from K into itself satisfying the 

oo 

condition (B2) and F := f] F(T n ) ^ 0. Let {x n } be a sequence of Mann type 

n=\ 

iteration defined by (1.5) and a n G [0, 1] satisfy 

< lim inf a n < lim sup a n < 1 . 

n >oo n — >QC 

Then {x n } weakly converges to some point of F. 
Proof. Take p G F. We have 

\\x n +\ ~P\\ < (1 -a n )\\x n -p\\ +a n \\T n x n - p\\ 

< {I - a n )\\x n - p\\ + a„\\x n - p\\ 

< \\x n -p\\- 
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Then {\\x n — p\\} is a decreasing sequence, and hence linin^oo \\x n — p\\ exists for 
each p G F and {x n } is bounded. Let C be any bounded subset of K containing 
{x n }, then 

\\T n+ ix n+1 - T n x n \\ < \\T n+ ix n+ i - T n+1 x n \\ + \\T n+ ix n - Tx n \\ 

~r||^ X n J-nXn\\ 

< \\x n +i -x n \\ + sup \\T n+1 x -Tx\\ 
xec 
+ sup \\Tx — T n x||. 
xec 

It follows from the hypothesis that 

limsup(||T„ +1 a; n+1 - T n x n \\ - \\x n+1 - x n \\) 

n — >oo 

< lim sup ||T n+1 x — Tx\\ + lim sup \\Tx — T n x\\ = 0. 
By Lemma 2.1, we obtain 



lim \\x n - T n x n \\ = 0. 



Thus, we have 



\X n I X n \\ \ \\X n -LnXnW > \\J-nXn J- X n \\ 

< \\x n - T n x n \\ + lim sup \\Tx — T n x\\ = 0. 



» xec 



Hence, 



lim \\x n - Tx„\\ = 0. 



Since E is reflexive, there exists a subsequence {x„ k } of {x n } such that x nk —^ x* 
for some x* G K . Then x* G F. In fact, suppose not. Then the Opial's property 
of E implies the following: 

limsup ||x„ fc — x*|| < limsup \\x nk — Tx*\\ 

k — ^oo k — >oo 

< limsup(||x„ fc - Tx nk \\ + \\Tx nk - Tx*\\) 

k — >oo 

= limsup \\x nk - x*\\. 

k — >oo 

This gets a contradiction. Hence x* = Tx* G F. 

Next we show x n —^ x* . Suppose not. There exists another subsequence {x ni } of 
{x n } such that x n . -^i/i*. Then, we also have x = Tx. From Opial's property, 
we have 

lim \\x n — x\\ = limsup \\x ni — x\\ 

n >ao % — >oo 

< limsup ||a; ni — x*|| = limsup \\x nk — x*\\ 

i — >oo k — >oo 

< limsup \\x nk — x\\ = lim ||x n — x||. 

Which gets a contradiction. So the conclusion of the theorem follows. 

Remark 2. It isn't known whether the assumption (B2) can be displaced by 
the weaker condition (B3). 

(B3) For any bounded subset C of K, there exists a nonexpansive mapping T of 
K into itself and a subsequence {T n .} of {T„} such that 

lim sup \\Tx - T nt x\\ = and F(T) = F. 

i^ 00 xec 
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1. Introduction 



The idea of statistical convergence was introduced by Fast [2] and Schoen- 
berg [7] independently. Later on it was studied by Fridy [3], Salat [6], and 
Tripathy [8] and many others. Gokhan and Gtingor [5] defined pointwise sta- 
tistical convergence of sequences of real-valued functions. Connor [1] gave an 
extension of the notion of statistical convergence where the asymptotic density 
in replaced by a finitely additive set function \i. 

Let N be the space of natural numbers. For each E C N, let K n (E) be 
the cardinality of the set ED [0,n]. The asymptotic (or natural) density of 
E is given by 5(E) = lim "^ - whenever the limit exists. Clearly finite 

sets have zero density, 5(E C ) — S(N — E) = 1 — S(E), whenever both sides 
exist, where E c is the complement of the set E in N. We say that a real 
number sequence (x n ) is statistical convergent to t provided that for every 
e > 0, 5({k e N : \x k - £\ > e}) = or , 5{{k e N : \x k - £\ < e}) = 1 for every 
e > in which case we write st-limxfe— £. 

Let X and Y be normed spaces and B(X, Y) be the normed spaces of all 
bounded linear operators from X into Y with the usual operator norm. 

As we know, a sequence of operators T k <E B(X, Y) tends to a limit T, where 
T : X — > Y is an operator, if given e > 0, we can find an integer kg such that 

i) \\T k - T\\ < e, for all k > k , 

ii) \\T k x — Tx\\ < e, for all k > fcg and for every x G X, 
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iii)||/(T fc a;) - f(Tx)\\ < e, for all k > k , for every xG X and / G Y\ 
where Y 1 denotes the set of all bounded linear functional on Y. Then T is called 
the uniform operator limit , strong operator limit and weak operator limit of 
(T k ), respectively. It is well known that (i) =>• (ii) =>• (in). 



2. Statistical Convergence of Sequences of Operators 



Some operator sequences does not convergence in above convergence modes 
but its might converge in a weaker sense. Therefore, in the present paper, 
we introduce a notion of uniform, strong and weak statistical convergence of 
sequences of bounded linear operators. 

Throughout the paper, (T k ) will denote a sequence of operators T k G B(X, Y). 

Definition 2.1. The sequence (T k ) is said to be uniformly statistical op- 
erator convergent to T, if for every e > 0, there exists such a set E C N that 
5(E) = 1 and 3k (e) G E 3 Vfc > k and k G E, \\T k - T\\ < e, 
i.e. for every e > 0, 

limi|{fc<n: \\T k - T \\ > s}\ = 0. 

n — >oo 

st 

In this case we write st-limT^ = T or T k —> T , where T is an operator from X 
into Y . 

Definition 2.2. The sequence (T k ) is said to be strongly statistical operator 
convergent to T, if for V e > and for V x G X, there exists such a set E x C N 
that S(E X ) = 1 and 3 fc e E x 3 Vfc > k and k G E x , \\T k x - Tx\\ < e, 
i.e. for every e > and for every x G X, 
lim i|{fe <n : ||T fe a;-Ta; II > e }| =0. 

n^oo 

si 

In this case, we write st-limTfcX = Tx or T k x — > Tx on Y" for every a; G X, 
where T is an operator from X into Y. 

Definition 2.3. The sequence (T k ) is said to be weakly statistical operator 
convergent to T, if V e > 0,Va; G X and V/G Y 1 , there exists such a set 
E x f C N that S(E X f ) = 1 and 3fc e E x f 3 Vfc>fc and k e E x f , 

\f(T k x)-f(Tx)\<s, 
i.e. for every £ > and for every i£l and every / G Y' , 

lim i|{fc < n : \f(T k x) - f(Tx)\ > e }= 

n^oo 

In this case, we write w.st-hmT/ c .T = Tx or T k x -^ ' Tx on Y for every x G X, 
where T is an operator from X into Y. 

Since the proof of the following theorems is obvious, we merely state its 
and omit its. 

Theorem 2.1: Let (T k ) and (Sfc) be two sequences of operators. 
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i) If TfcX — ► Tx and S k x — » Sec on 1" for every x £ X, then aT k x + 
/35fcX — ► aTi + /3Sx on Y" for every id, where a, j3 £ K. 

ii) If T k x — > Tx on Y for every x € X, then ||Tfcx|| — > ||Tx|| for every 

x £ X, 

where /i=st. or w.st. 

Theorem 2.2: Let (Tfe) and (S k ) be two sequences of operators. 

i) If T k ^ T and S fc ^ S 1 , then oT fc + /3S fe ^aT + [3S , where a, /3£ R. 
ii) If T fe ^T, then ||T fe || 8 -h \\T\\. 



3. Relations between Modes of Convergence 



It is not difficult to show that 
i) T k ^T^T k s ^T, 

ii) T k x — » Tx on 1" for every i£l=> T-x — ► Tx on Y for every x £ X, 
m)T k x — ► Tx on Y for every x £ X =>• T^x -^ ' Tx on Y for every 

x£X. 

But we note that the converses of (i), (ii) and (iii) are not true. The following 

examples are provided to clarify these. 

Example 3.1. Now, let us consider a sequence (Tk) of operators T k : 
^oo -* c is defined by 

T fe x=( Ki.^.^.-). fce[3*,3"+j>),p=l,2,... 
\ (0,0,0,...), otherwise 

where, x =(£i,£2>£3) •••)*= ^oo- The operator Tj, is linear and bounded for every 
fc £ N. (Tk) is uniformly statistical operator convergent to T — since 

i|{fc<n:||T fe -0||> £ }<^ 
for every p, n £ N such that 3 P < n < 3 P+1 , where 

MT _ nil — / X ' fce[3",3"+p),P=l,2,... 

11 * "" \ 0, otherwise 

for each fc £ N. However, (Tk) is not uniformly operator convergent to T = 
since lim ||Tfc|| does not exist. 

k — >oo 

Example 3.2. A sequence (Tk) of operators T k : £ 2 — > ^ 2 is defined by 

r ((yw),^,^,^,...), fce[3P,3P + p),p = i,2,... 

±k x \ (fc zeros) 

l, (0,0,0,...), otherwise 
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where, a; = (£i,£2>£3; —) e ^ '■ This operator T k is linear and bounded for every 
k G N. We show that (Tfc) is strongly statistical operator convergent to T = 
since 

£|{fc<n:||T fc s-0s||> e }|<*£±£ 
for every p,n GN such that 3 P < n < 3 P+1 and for every x £l 2 . So st-liml^a: = 
on I 2 . However (Tk) is not strongly operator convergent because for x = 
(1,0,0,...) e£ 2 we have 

{V2, m 7^ n and n£l p for some p G N,m G I r for some r G N 
1, nelp and m ^ I p for some p G N 
0, m,n ^ I p for all p G N or m=n 

where I p = [3 p ,3 p +p). 

Example 3.3. In the space £ 2 , we consider a sequence (Tk), where Tk 
: I 2 — > I 2 is defined by 



T v _ > (£i,&,-)> * e [3*, 3? +p),p= 1,2, ... 



(0,0,0,...), otherwise 

where, x = (£i,£2>£3> •■■) € ^ 2 and Tk G B(X, Y). It is easy to see that (Tk) is 
weakly statistical operator convergent to since 



,, £&Ci, k€[3 p ,3 p +p),p=l,2, ... 
0, otherwise 

by Riesz representation, where z — (£J G ^ 2 . However it is easy to see that 
f(T k x) * /(Ox). 

We now proceed to examine some relationships between uniformly sta- 
tistical operator convergence, strongly statistical operator convergence and weakly 
statistical operator convergence. 

Theorem 3.1: Uniform statistical operator convergence implies strong 
statistical operator convergence with the same limit. But the converse of this 
theorem is not true, as the following example shows: 

Example 3.4. In the space i 1 , we consider a sequence (Tk), where 
T k : t 1 -> I 1 is defined by 



Tr= f (£i'&,-). fce[3*,3*+p),p=l,2, ... 

lkX \ (0,0,...,0,2£ fe ,2£ fc+1 ,2£ fc+2 ,...), otherwise 

where, x =(£i,£2>£3> • ••)€ ^ ■ This operator Tk is linear and bounded for every 
k G N. Clearly, for every x G I 1 , 
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±\{k<n: \\T k x-0x || > e }| — ► 0, 
i.e. st-limTfcX = Ox on £ . But (Tk) is not uniformly statistical operator conver- 
gent to T — since 



|Tfc-0|| = 



1, ke[2P,3P+p),p=l,2, 

2, otherwise 



Theorem 3.2. Strong statistical operator convergence implies weak 
statistical operator convergence with the same limit. 

We note that the converse is not true. For this let us consider the following 
example. 

Example 3.5. A sequence (T k )of operators Tk ■ £ 2 — > I 2 is defined by 

f Ki^2.e 3 .-). fce[3",3*+p),p=l,2, ... 

T fe x=:< (0,0, ....O^,^,^, ...), otherwise 

^ (k zeros) 

where x — (£i, £2)^31 •■•) G P ■ This operator Tk is linear and bounded. We 
show that (Tk) is weakly statistical operator convergent to but not strongly 
statistical operator convergent. 

Every bounded linear functional / on £ 2 has a Riesz representation. 
Hence, setting i = k + j and using the definition of T k , we have 



£&Ci, k £ [y, 3? +p), P = 1,2,3, 

f(T k x) = (T k x , z) = _ _ _ 

E ti-k(i = E^Cfe+j, otherwise 
i=fc+i j=i 

where z — (Cj) G^ ■ By the Cauchy-Schwarz inequality 

{00 00 

E&l ElCil , fce[3*,3*+p),p=l,2,3, 
^T 1 ,2 isl 
E Kil E I Cm I , otherwise 
j — 1 m—k-\-l 

st 

Since the set [3 P , 3 P +p) has density zero, it follows that f(T k x)—>0 — f(0x) 
by Theorem 2 in [8]. Consequently, (T k ) is weakly statistical operator convergent 
to 0. However (Tk) is not strongly statistical operator convergent to 0, because 

we have 

* 00 

(El&l ) V2 - fc €[3P,3 p +p),p =1,2,3, .. . 

i7)..c-o.c|| = <; i =1 2 

( E |£i-fc| ) 1/2 . otherwise 

i=k+l 

and so IllfcX — Ox || ^>0. 
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We know that in the finite dimensional normed spaces, the strong and weak 
convergence are equivalent concepts. The below result is a statistical analoque 
of this result. 

Suppose that T k x —* Tx and dim!" = n. Let {ei, e2, •••, e„}be any basis for 
Y and {/i, / 2 , ■•■,/„} be its dual basis and, say 

(k) (k) 

TkX — a\ e\ + ... + (xh e n and Tx — U\e\ + ... + a n e n for every x G X.Then 

fj(Tkx) — or- ,fj(Tx) — otj. Hence and by our assumption fj(Tkx)—>fj(Tx) 

for fixed fj(j — 1, ..., n) implies ck- — > olj for j = 1, ..., n. From this we readily 
obtain 



(fc) 
ct- 

3 



^0 



||T fc x-Tx|| < £ 
for every x G X. This shows that (T^) converges strongly statistically to T. 



4. Statistical Cauchy Sequences of Operators 



Now, we introduce the statistical analog of the Cauchy convergence criterion 
for the sequences of bounded linear operators. 

Definition 4.1. The sequence (Tfe) is a uniformly statistical operator Cauchy 
sequence provided that for every e > 0, there exists such a set E C N that 
5(E) = 1 and 3 fc (e)e E and an N(= N(e)) 3 Vfc > fc , k G E, \\T k - T N \\ < e. 

Definition 4.2. The sequence (Tj.) is a strongly statistical operator Cauchy 
sequence provided that for every e > 0, for every x €X there exists such a set 
E x C N that 8{E X ) = 1 and 3k (= k (e,x))e E x and an N(= N(e,x)) 3 Vfc > 
fc , fc e E x , \\T k x - TatxU < e. 

Definition 4.3. The sequence (Tk) is a weakly statistical operator Cauchy 
sequence provided that V e > 0,Vx eX and V/ e F 1 , there exists such a set 
£x,/ C N that S(E x j) = land 3 fc € S^,/ and an JV(= N(e)) 3 Vfc > fc and 
fc € S Kj/ , |/(T fc x) -'/(Tjvx)! < e. 

Theorem 4.1. Let Y" be a Banach space. Then (Tk) is strongly statistical 
operator convergent on X if and only if (T k ) is a strongly statistical Cauchy 
sequence on X. 

Proof: Assume that st-limTfeX = Tx for every a;G X.Then for every s >0 
and for every xG X there exists such a set E C N that 6(E X ) — 1 and 3 fco G 
E x 3\/k> fc , fc G £ x , ||T fe o; - Ta;|| < e/2. If TV is chosen so that ||TAra; - Ta;|| < 
e/2 for every xG X then we have ||2fcX — Tjvx||<e for 3fco G E x , Vfc >fco, fcG E x 
and for every xG X. 

Now suppose that (T k ) is strongly statistical Cauchy sequence. The we may 
choose n(l) such that the set A\ = {fc : ||TfeX — T n (i)x|| < l} has asymptotic 
density 1 for every iel. Suppose that n(l) < n(2) < ... < n(p) have been se- 
lected, iil<r<s<p and A s = {fc : ||XfcX — T„( s )x|| < 2 1_s }, then the set A r 
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has asymptotic density 1 for every x <E X and n(s) € A r . Choose N such that 
the set {k : \\T k x — Tnx\\ < 2~( p+1 )} has asymptotic density 1 for every x <G X. 

N 

Since the set ( f] A,) n {k : \\T k x - T N x\\ < 2~( p+1 )} has asymptotic density 

N 

1, there exists an n(p+ 1) € ( f| A,-) n {fc : ||T fe x - T N x\\ < 2"( p+1 )} such that 

n(p) < n(p+l) and A p+1 = {£: : ||T fc a: - T n(p+1) x|| < 2~p} D {k : \\T k x - T N x\\ < 2-^+^} 
for every x € X. Observe that A p+ i has asymptotic density 1 and n(p+l) G A s 
for all s < p + 1 and for every id 

Note thet since ||T„( p )a; — T rl ( p+1 )a;|| < 2~ p for every x G X, (T„( p )) is 
Cauchy sequence on Y. Since Y is Banach space, there exists a Tx <G Y such 
that lim T n t p \x — Tx. We claim that (T k ) is strongly statistical convergent 

p — >oo 

to Tx for every x € Y.Let e > be given and select p € N such that 
||T„(j,)X - Tx\\ < | for every x e X and e > 2~ p . Note that if \\T k x - Tx\\ > e 
then ||T n ( p )a; — T k x\\ > f > 2 1_p , and hencek is not element of A p . It follows 
that {k : \\T k x — Tx\\ > e}has asymptotic density zero for every x e X. Hence 
(Tfc) is strongly statistical operator convergent on X. 

Theorem 4.2. Let Y be a Banach space. Then (T k ) is uniformly statistical 
operator convergent on X if and only if (T k ) is a uniformly statistical Cauchy 
sequence on X. 

Proof: It can be shown in a similar way of Theorem 4.1. Therefore, we omit 
it. 

The next theorems are statistical analogues of a well-known theorems 

Theorem 4.3. Let every statistical Cauchy sequence in Y be statistical 
convergent sequence. Then every statistical Cauchy sequence in B{X, Y) is 
statistical convergent. 

Proof: We consider an arbitrary statistical Cauchy sequence (T k ) in B(X, Y) 
and show that (T^) statistical converges to an operatorT e B(X, Y). Since (T k ) 
is statistical Cauchy, for every e > and there exists such a set E C N that 
6(E) = 1 and 3k e E, 3N e N VA: > k , k e E , \\T k - T N \\ < e. For all x€ X 
and Vfc > fcoj k, fco € T, we thus obtain 

\\T k x-T N x\\ <e\\x\\ (1) 

Now for any fixed x and given ei,we may choose e = e x so that e x \\x\\ < e. 
From (1), we see that (T k x) is a statistical Cauchy sequence in Y . Then (Tfex) is 

st 

statistical convergent, say, T k x — > j/. Clearly, the statistical limit i/eY depends 

on the choice of x€ X. This defines an operator T : X — > Y, where y = Tx. 

Since 

T(ax + /3y) = st-limTfe (ax + /3y) = ast-limTfeX + /3st — UmT k y — aTx + j3Ty, 

the operator T k is linear. Using the continuity of the norm and Lemma 5 [7], 

we obtain from (1) for every e >0 and Vfc >fco, kGE, an TV £ N and all xGX, 
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= 


\\T N x- {T N x-Tx)\\ 


< 


||T/vx|| + \\Tnx — st-limTfcxH 


< 


c ||x|| + st — lim ||XjvX — Tkx\\ < (c + s 



\Tx\\ 



since Tn is bounded. This shows that T is a bounded linear operator. Further- 
more, we obtain \\Tk — T\\ = sup \\TkX — Tx\\ — > from TkX — > Tx for every 

||x|| = l 

xe x. 

Definition 4.4. The sequence (Tk) is said to be statistical bounded if 
there exists such a set E c N that 5(E) ~1 and exists a M x > 0, such that 
||T fc x||< M x for all k g £ and every i e I. If A'4 = M, then (T fe ) is said to be 
uniformly statistical bounded. 

Lemma 4.1: Let (T k ) be a sequence of operators T k GB(X, Y) and B r (x ) C 
X be a closed ball such that (||Tfcx||) is statistical uniform bounded for \/x€B = 
B r (x ), say, 

sup ||T fe a:|| = c, 

keE 

where c is a real number and S(E)=1. Then the sequence (||Tfc||) is statistical 
bounded. 

< r, so that 



Proof:Lct x g X be arbitrary, not zero. Then 



x o + irar - ^o 



Ikll 



.To + tSt g B . This yields for all k GE, where the set E C N and 5(_E) = 
1, ||T fc a:|| < f \\x\\ .Hence for all k eE, \\T k \\ = sup ||T fc x|| < 2 f. 

\\x\\=l 

Theorem 4.4. Let (Tfe) € _B(X, Y"), where X is a Banach space and Y a 
normed space. If there exists a set BcNof asymptotic density 1 such that for 
V.xe X and for Vfc g E, 

\\Tkx\\ < c x 
where c x is a real number, then there is a c such that 

l|7fc|| <c 
for all keE. 

Proof: We assume that the sequence of the norms ||T fe || is not bounded 
on the set E. Then the sequence (||Tfea;||) is not also bounded on the set E 
and on all closed ball in X since Lemma 4.1. Then there exists 3xi&Bq and 
3niGE such that ||T ni xi|| > 1. Since T ni is continuous and so is the norm, we 
have ||T ni x|| > 1 on B\ = B ri (x\) c B - Hence there exists 3 X2&B1 and 
n 2 > ni,3n 2 G E, ||T„ 2 x 2 || > 2. Continuing in this way we obtain a sequence 
(xk) and (Bk) of closed balls such that x k € Bk and B D B\ D ... D B k D ■■■ 
and ||T nfc cc|| > k for nk g E and on the Bk- This yields for m < n 2 < ... 
and xG X, \\T nk x\\ > k, where nk € E for k — 1,2, .... Hence we see that the 
sequence (T n x) is not statistically bounded. 
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Theorem 4.5. Let T k EB(X,Y), where X is a Banach space and Y a 
normed space. If (Tk) is strongly statistical operator convergent with limit T, 
then T eB{X,Y). 

Proof: From Theorem 4.4, the proof is trivial. 



5. Statistical core of sequences of operators 



Let (Tk) be a sequence of operators. For any sequence T = (||Tfcx||),the 
statistical limit superior of T and statistical limit inferior of T are 

. , • rr f sup E Tx , if E Tx ^ 

st-limsupT x = { _ Qo iifETx = 

. ,. • t rr J mfF T;CC ,if F T<X ^ 

st — Limin 1 1 x — S ■ t n ™ 

3 \ +oo ,if F T , X = 

where E T , X = {a x € R/S({k : \\T k x\\ > a x }) ^ 0} and F T , X = {b x e R/<5({fc : 

||T fc a;|| < b x }) + 0} for every xEX. 

For example, consider the sequence (Tk) of operators Tk : I 1 —> I 1 defined 

by 

( (fc,0,...,0,...), fc = n 2 n = l,2,... 

T fc x=<{ (2^ 1 ,...,2^„,...), k ^ n 2 and k = In - 1, n = 1, 2, ... 
[ (3£i,.-.,3£„,...), fc 7^ n 2 andk = 2n, , n=l,2,... 

where a; = (£ 1; ...,£„, ...) € ■£ . The operator T& is linear and bounded for every 
fixed k £ N. It is easy to see that Et, x — (-co, 3 ||x||) and Fx yX — (2 ||x|| , +oo) 
since 

{k, k = n 2 n = 1,2, ... 

2||x||, fc^n 2 simI fc = 2n-l, n= 1,2,.. . 
3||x||, k y^ n 2 and k — In, , n = l,2, ... 

Clearly, ||Tfcx|| is unbounded but it is statistical bounded. For this sequence, 
st — HmsupT x = 3 || x || and st — limin f T x = 2 ||x|| . Furthermore, we can easly 

obtain from [4] that for any bounded sequence T = (||Tfcx||) (i.e. sup ||Tfcx||<co 

feeN 
for every id) 

st — limsupT x =(3 x ^=^£oi any e > and for every x £ X, 6({k : ||Tfex|| > 
P x ~ £ }) 7^ an d $({k '■ \\Tkx\\ > j3 x + e}) = 0;and st — limin fT x =a x ^^for 
any e > and for every x G X, 5({k : ||Tfex|| < a x + e}) ^ and 5({k : 
||T fe x||<a x -e})=0. 

We can define the statistical core of bounded linear operators as follows: 

Definition 5.1. For any statistical bounded operator sequence T = (||Xfcx||), 
the statistical core of T is the closed interval [st — limin fT x , st — UmsupT x ]. 
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If (Tfe) is not statistical bounded, the statistical core of T is defined by 
either (—00, st — UmsupT x ], [st — UminfT x , 00) or (—00,00). 

Let X and Y be two nonempty subset of the spaces of complex sequences. 

Let A = {a n k)[n,k = 1,2,...) be an infinite matrix. We write Ax — (A n (x)) 

00 
if A n (x) = J^ cinkXk converges for each n G N. Thus, we say that the matrix 

fe=i 
A defines a matrix transformation from X into Y . A is called regular if xG c 

implies Ax <E c and preserves the limit, where c is convergent sequences space. 

In [4] Fridy and Orhan prove necessary and sufficient conditions for which 

the inequalities 

limsup Ax < st — limsup x 

and 

st — liminf x < liminf Ax 

for every x€£oo. 

Now, Similarly, we will give these results for sequences of bounded linear 
operators. 

Let A = (a„fc) be an infinite summability matrix. For a given sequence of 

bounded linear operators T — (T„), the sums 

00 

Al n X = 2_i a nk-LkX 
fc=l 

are called the ^-transform of the T provided the series converges for each k e N 
and for all xe X and denoted by ^r = (AT n ). 

Lemma 5.1: Let(T„) be a sequence of uniformly bounded operators ( i.e. 

there is a positive number M such that HT^rrH < M for all seX and for all 

00 
n £ N ) T„ 6 B(X, Y). Suppose the matrix A satisfies sup n "^2 \&nk\ < 00 then 

fe=i 

limsup(sxq> ||AT„a;||) < st — Hrnsup( sup ||T„a;||) (2) 

x£X x£X 

if and only if 

i) A is regular matrix and lim ^2 Wnk\ — for E E N such that 5(E) = 0; 

" keE 

ii) lim J2 \ a nk\ = 1- 

n fe=i 
Proof: (Necessity) Assume that for any uniformly bounded sequence 

(||T„x||) on7, the matrix A = (a n k) satisfies condition (2). Let a n = sup HTn^H -Then 

xex 

00 

y = [o-n) is a positive real number sequence and y G £00 • Since sup n "^2 \o-nk\ < 

k=l 

00, we obtain that 



sup(sup ||AT„x||) < sup(supV] \a nk \ \\T k x\\) 
ngN xex 7 

< supa fe (sup^ |a nfe |) < 00, 



nGN xGX riGN x£X , 

k 



fcGN n£N , 
k 
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i.e., (sup |jAT„x|j) <E i^. Thus, using Lemma in [4], conditions (i) and (ii) can 

xeX 

be easly proved. 

(Sufficiency) Assume now that (i) and (ii) holds. Let (||T n x||) be any 
uniformly bounded sequence on Y. Then there is a positive number K such 

oo 

that sup ||T n x|| = K for all xG X. Since sup n "^2 \o>nk\ < °°> we have 

n£N fe=l 



/ , d nk T k Q 



< ^2 \ank\\\T k x\\ < K^2 |a„ fe |<oo. 



Hence (AT n x) € 4k,. Now, let (3 X = st — limsup(sup ||T„x||).Then we have E — 

xex 
{k : sup ||Tfcx|| > f3 x + e} for a given s >0 and 5(E) = 0. Hence it is clear that 

xex 
sup ||Tfex||< f3 x + e for k £ E. Now we can write 
xex 



2 , a n k T k S 



^2(\a n k\ \\T k x\\ +a nk T k x)2 1 - ^(|a„ fc | \\T k x\\ - a nk T k x)2 1 



< 



2 y a nfeT; 



A'X 



+ ^( |a nfc | - a„ fe ) ||T fc x|| 



< K ^2 \ a nk\ + X! I a " fc l (sup ll^fcxll) + K^2( \a nk \ - a wfc ) 

feS-E k (£E xeX k 

< K ^2 \ a nk\ + Wx + e )5Z \ a nk\+ K^(\a nk \- a nk ). 

keE kf^E k 



Then we conclude that 
^ a nk T k x 



sup 



< -K" ^ |a TO fc| + (P x H-g)^ |a„ fe | + if^( |a„fc| - a nk ). 

k£E k 



keE 



Using the (i) and (ii), we have 

limsup (sup ||ylT„x||) < j3 x + e. 
xex 
Since e is arbitrary, this completes the proof. It is clear that one can prove a 
similar way in the following lemma. 

Lemma 5.2. Let (T k ) be a sequence of uniformly bounded operators T k <G 
B(X,Y). 

oo 

Suppose the matrix A satisfies sup„ J^ |a n fe| < oo then 

fe=i 

st — liminf(ini \\T k x\\) < liminf(m£ \\AT k x\\) 
xex xex 

if and only if 

i) A is regular matrix and lim ^2 \a>nk\ — for E C N such that 8(E) = 0; 

n keE 

oo 

ii) lim J2 \ a nk\ =1- 
n fe=i 
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From Lemma 5.1 and 5.2, we give the following theorem. 

Theorem 5.1: Let (Tfe) be a sequence of uniformly bounded operators 

oo 

Tfc G B{X, Y). Suppose the matrix A satisfies sup n ^2 \dnk\ < °° then 

fc=i 
limsup(sup \\AT n x\\) < st — limsup(swp ||T n x||) 
xex xex 

and 

st -liminf '( inf ||T fe x||) < liminf {mi \\AT k x\\f 

xeX x£X 

if and only if conditions (i) and (ii) in Lemma 5.2 are satisfied. 
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Abstract 

The aim of this paper is to find, in a closed form, special solutions 
of equations describing a one-dimensional non-stationary flow of a liquid 
containing dissolved gas. The special feature of these solutions is that de- 
spite the fact that they do not satisfy all initial and boundary conditions, 
they describe a physical characteristic qualitatively analogous to that de- 
scribed by the original equations. Thus these special solutions may prove 
useful means for judging the reliability of the original mathematical model 
of the problem. 
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1 Introduction 

In the paper we analyze the mathematical model of the flow of a column of a 
real liquid. 

It is known that cavitations play an important role not only in the theory 
of fluids but may be even more significant in the engineering and technological 
practice. Let us mention the cavities and bubbles which appear at suction 
compartments of pumps, in turbines, or in hydraulic machinery. Monitoring of 
possible separation of the gas from liquids is important since if it appears, then 
there is a danger of failure when enormous forces are loaded to devices which 
serve, for example, in the building industry. Therefore, a dynamic model of a 
two-phase flow has been derived under physically realistic assumptions. This 
has been done in the former Institute for Construction of Machines in Bcchovice, 
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Czech Republic (see [1]). To preserve the author's and the institute's rights, we 
do not present here the derivation of the model and refer the interested readers 
to Professor Jan Sklfba at the Technical University of Liberec, Czech Republic 
(jan.skliba@tul.cz). The purpose of this paper is to partially mathematically 
analyze the model to be defined below. Our analysis is based on a special 
a priori assumed form of solutions to describe certain features of the flow in 
question. We balance this limitation by providing solutions in a closed form. In 
spite of this restriction we believe that our approach is useful in the engineering 
practice, based on the authors' experience in application of similar mathematical 
analysis to other problems. 

2 The formulation of the problem 

The equations of the flow of a real liquid of the length I is possible to write in 
the form (see for instance [1]): 

wt + Po 1 Px + f(w)=0 7 (2.1 

p t + poC 2 (p,^)w x = 0, (2.2 

Jt + w lx = g(j,p), see (<M), te(0,T), (T>0), (2.3 

w(x,0) = wo(x), (2.4 

p(x,0) = po(x), (2.5 

7(3,0) =7o(a0, xe[0,I], (2.6 

C(p(0, t), 7 (0, t)) Pt (0, t) + Q v (p(0, t),H(t)) - S Q w(0, t) + <pH{t) = 0, (2.7 

w{l,t) = h{t), (2.8 

H{t) + $(t,H{t),H{t),p{0,t), Pt {0,t))=0, ie[0,T], (2.9 

H(0) = H , H(0) = H X . (2.10 

The quantities occurring in (2.1)-(2.10) have the following meaning: 

w = w(x, t) the velocity of the liquid in the point x and in the time t, 

p(x, t) the pressure, 

7 = 7(x, i) the mass of the freed air in the unit volume of the liquid, 

Pq the density of the liquid, 

c = c(p, 7) the sound velocity in the liquid and in the liquid 

containing the air, respectively (given function of p, 7), 
/ = f(w) the coefficient of the resistance (the friction of the liquid 

on the wall of the duct), an odd function, 

= {K u ((7-7)/K H -p), if {^-i)/K H >p, 
\K r ({7-i)/K H -p) if (j--f)/K H <p, 
K u , K r the constants characterizing the proportionality of the 

velocity of loosening, and dissolution on the pressure 

gradient, respectively, 
Kh the coefficient of absorption, 



g = g{i,p) 
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C* = 


= C-(p, 7 ) 


H 




Qv 


= Qv(p,H) 


So 




<P 
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the total mass of the air in the unit volume, 
the initial distribution of the velocity, the mass, and the 
pressure of the loosened air in unit volume, respectively, 
the hydraulic capacity (the given function of p, 7) , 
the throw of the valve, 

the flow through the valve (the given function of p, H) , 
the cross-section of the duct, 
the acting facing of the valve, 

the flow rate caused by the hydrogenerator at the end 
of the duct, 
Ho, Hi the initial position, and the velocity of the valve, 

respectively. 

In what follows, we assume that all given functions are sufficiently smooth, 
and the solution will be sought smooth as well, i.e., continuously differcntiable. 

The special solutions of our problem will be the functions w, p, 7 satisfying 
equations (2.1), (2.2) and (2.3). The special feature of these solutions is that 
despite the fact that they do not satisfy all initial and boundary conditions, 
they express physical characteristics qualitatively analogous to those described 
by (2.1)-(2.10). 

3 Stationary solution 

Three functions w — w(x), p — p(x), 7 = 7(2;) depending only on the length 
coordinate x of the tube, and satisfying equations (2.1) to (2.3) are understood 
as stationary solution. These equations written for functions independent of the 
variable t form a simple system of three ordinary differential equations ([2]) 

PoV + /(«0=O, (3.1) 

p c 2 (p,j)w' = 0, (3.2) 

wi = g{^,p), a; €(0,0, (3.3) 

where p' = dp/dx etc. Analogously as in [1] the function c{p, 7) will be assumed 
in the form 

c(p,7)= 2 °f, , (3-4) 

C2P 2 + 7 + c 3 

where Cj > 0, i = 1, 2, 3 are constants. The physical principles suggest the 
condition c(p, 7) > 0. Since the trivial solution with p = is not interesting, the 
equation (3.2) gives us w' — and from here we have 

w = w = constant. (3-5) 

Consequently, (3.1) implies 

p(x) =p - pof(w )x, (3.6) 
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and for the function 7 we obtain the equation 

7' = — g(l,Po - Pof(w )x). (3.7) 

w 

The constants wq, Po niay be chosen arbitrarily. Also the integration of (3.7) 
gives an additional free integration constant. The heuristic considerations indi- 
cate that the stationary solution should be a limit of a non stationary solution 
for t — > 00. In order to respect the boundary conditions partially at least we 
will require the stationary solution to satisfy the generalized limit of boundary 
conditions. More exactly, we impose the condition (compare with the (2.8)) 

1 f* 

wo = lim - / h(s)ds. (3-8) 



4^00 t JQ 

Naturally, we must suppose that the function h is such that this limit exists. 
The limit is requested in that sense since the delivery of the hydrogenerator may 
regularly oscillate about a certain value so that the limit lim^oo h(t) need not 
exist. The number po in (3.6) is then determined from (2.7), i.e., from 

Qv(po,H )-S w = (3.9) 

supposing that equation (3.9) is solvable with respect to pq. It remains to 
determine the function 7 from (3.7). It is obvious that the sign of u>o indicates 
the direction of the flow. If the hydrogenerator is supposed to be at the point 
x = lit is logical that w < and then it is necessary to describe j(l) = 70 - the 
mass of loosening air in the unit volume of incoming liquid. Since the function 
g in the right-hand term of (3.7) is given by different formulas for loosening and 
dissolution of the air we must distinguish between the following two cases: 

' v (l~l \ - t 7-7^ 



x K H J K H 

9(1,P) ={ ;_ ( _ (3.10) 

K r -F? P ' lf -F? < P- 



Let us define the functions 

<p(x) =j- K H \p - p f(w )x], (3.11) 



and 



^ = -K u if e > 
m) = . w K H (3 i2) 



K r 



= -K 2 , if $ < 0, Ki > 0, i = 1, 2. 



w K H 
Then it is possible to rewrite equation (3.7) in the form 



Y = (¥>(*) -7) ■*■(¥>(*) -7), (3-13) 
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or, if we put 

y(x) = ip(x) -j(x), (3.14) 

j/ + yK(y) = cp'(x). (3.15) 

According to (3.11) it is 

ip'(x) = K H p f(w Q ) = ipo = const. (3.16) 

If (7 — Jo)/Kh > p(l) — Po ~ Pof(wo)l, then y(V) > 0, and the continuity of 
the solution y of (3.15) implies -K"(y) = K u /wqKh = —K\ = const, for x < I 
sufficiently close to k /. Consequently, equation (3.15) is linear on this interval 
and we have 

y(x) = exp(lfi(a; - l))y(l) + / exp(ifi(a; - £))^ d£ 

= exp^z 0)1/(0 + CXP( ^~ 0) ~W (3.17) 

Since w < and the function /(u>) describing the friction is necessarily odd 
and positive for positive w's, from (3.16) and (3.17), it follows that y(x) > 
exp(Ki(x — l))y(l) > in the whole interval [0, 1]. But this fact means that the 
function y(x) is defined for all x e [0,1] by formula (3.17). Using (3.17), (3.14) 
and (3.11) we then obtain for the function j(x) the formula 

l( x ) = <p(x) - v( x ) = 7 - K H [p - pof(w )x] 

- exp(-lfi(/ - x)) (7 - K H [p - pof(w )l] + 7o) 

1 - cxp(- Kiil-x)) 
^ K H Poj{wo) (3-18) 

for x <G [0,1], and K\ = —K u /wqKh- 

On the other hand, let (7 — Jo)/Kh < p(l) = Po — pof( w o)l, i- e -, y{l) < 0. 
Then it follows - again from the continuity - that K(y) = —K 2 in (3.15) for 
x G {x* ,1] where x* is the upper bound of numbers in the interval (—00,/) 
satisfying y(x*) = 0. On the interval (x* ,1], the solution y(x) is given by 
formula 

y(x) = exp(K 2 (x - l))y(l) + ^M^x- I)) - I ^ 

If x* < then the solution of (3.15) is given by (3.19) in the whole interval [0, 1]. 
From the condition y(x*) = and from (3.19) we obtain the unique point 

x* = l+-^ln\ ^-777). (3.20) 

K 2 y<p + K 2 y{l)J 

If x* > which means the assumption 

70 <7-^ffbo-po/(w )^H — (exp(- - — ) -1J, (3.21) 
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as we obtain after elementary calculation and substitution from (3.16), (3.14) 
and (3.12), then y(x) is given by (3.19) only for x G (x*, I]. In the interval [0, a;*] 
we then easily obtain that 



y(x) = cxpl^K^x - x*))y(x*) + / exp(ATi(x - £))<A)d£ 

J X* 

K%p w f(w ) /, / K u 



K„ 



i l -^^t H ^-^)\ ^ 



(y(x*) = !), and for x* it is necessary to substitute from (3.20). If we substitute 
from (3.14), (3.11) in the formulae (3.19) and (3.12), respectively we obtain the 
formula for 7(2) analogously as in (3.18). 

The stationary problem is completely solved. 

4 Oscillatory solution 

Oscillatory solutions are called such solutions of equations (2.1) to (2.3) which 
do not depend on the space variable x. Thus, w = w(t), p = p(t), 7 = "f(t). In 
the special case, the system (2.1) to (2.3) may be written as follows: 

w + f(w) = 0, (4.1) 

p = (4.2) 

7 = g(-y,p), t>0. (4.3) 

We see that the equations are practically separated since if we compute 

p(t) = po = const., (4-4) 

from (4.2) we have two separate equations, (4.1) and 

7 = . .9(7, Po)- (4.5) 

The solvability of equation (4.1) for t G (0, 00) is guaranteed by the assumption 

m = inf f'(w) > —00. (4-6) 

(f'(w) is continuous). The proof of this assertion is an elementary consequence 
of the theory of ordinary differential equations. Assumption (4.6) is practi- 
cally always fulfilled. If, for example, f(w) = k\w\w (k > constant), then 
f'(w) — 2k\w\ > = to. As far as equation (4.5) is concerned the global ex- 
istence of the solution is guaranteed by the inequality |.g(7i,Po) ~ 9(12, Po)\ < 
max{K u /Kjf,K r /KH}\ji — 72I, 71,72 <= K which follows from (3.10). This 
condition means nothing else than that the function g is globally Lipschitzian 
with respect to the variable 7. Naturally, it is possible, for w and 7, to prescribe 
the initial conditions 

w(0) = wo, 7(0) - 7o- (4.7) 
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This is a possibility how to satisfy, at least partially the initial conditions (2.4) to 
(2.6), naturally only with constant functions wq(x) — Wo, Po{%) — po, 7o(#) — 
7o- On the other hand, boundary conditions (2.7), (2.8) will be never satisfied 
by this type of solution. 

Equations (4.1), (4.5), (4.7) can be solved numerically by well known ap- 
proximate methods for solving ordinary differential equations. The solution in 
a closed form can be obtained only for a particular choice of the function /. 
Hence, let us investigate the case 

f(w) = k\w\w. (4.8) 

If wo > then we have, due to (4.1), (4.8), f™ dw/kw 2 = —t and from here 

w (t) = —^ — . (4.9) 

w 1 + kw t y ' 

If wo < we obtain analogously 

wit) = —^ . (4.10) 

w 1 - kw t y ' 

The formulae (4.9), (4.10) may be, for both cases, joined in one 

w(t) = - — ^—r- (4.11) 

w l + k\w \t y ' 

Equation (4.5) may be now rewritten in the form 

y + yK(y) = 0, (4.12) 

where y(t) = 7 - p K H - 7(4), 

Kiu=i K y (4.18) 




If j/o = 7 — PoKh —70 > 0, the solution of (4.12) is the function y(t) — 
exp(-(K u /K H )t)y Q . If y < then y(t) - cxp(-(K r /K H )t)y . Using (4.13) 
we have from here 

7(t) = j-poK H -cxp(-— —t), if 7o<7-Po-?Oj ( 4 -14) 

and 

/ K r \ 
7(*) =1-PqK h -cxp(^-— -tj, if -y > j~p K H . (4.15) 

The oscillatory solution is given by formulae (4.4), (4.11), (4.14) and (4.15). 

Here the oscillatory solution do not respect fully its appellation. Namely, the 
functions (4.14), (4.15) stabilize for t — > 00 to the steady state value 7 — PqKh 
without any overswing. 
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5 Combined solutions 



By combined solution we mean in this context such solution of equations (2.1) 
to (2.3) which is neither stationary - nor oscillatory and for which at least one 
of the functions w, p, 7 depends only on x or t. 

Consider first such solutions, for which w = w(t), p — p(x). For this case 
the equations (2.1), (2.2), (2.3) have the form 

w+—p'+f(w) = 0, (5.1) 

Pa 

lt+wix= 9(l, P)- (5-2) 

Equation (2.2) is obviously satisfied identically From (5.1) it follows 

p' (x) = const. (5-3) 

Instead of (2.7), (2.8), choose for p the boundary conditions 

p(0)=Po, p(l)=Pi, (5.4) 

which imitate the gradient of the pressure caused by the hydrogenerator. Then 

p(x)=p Q -\ x, (5.5) 

as it follows from (5.3). Consequently (5.1) gives 

■ _,*/-> Pa- Pi / Kfl x 

w + j(w)= — (5.6) 

Pol 
for the function w. If we suppose (4.6) and supply the initial condition 

w(0) = w (5.7) 

we know that there exists a global solution w(t) of problem (5.6), (5.7). If we 
know such solution we can find the function 7 from the equation 

-ft + w(t)j x = ,g(7,_po H j a;) (5.8) 

by the method of characteristics. Naturally it is necessary to add the corre- 
sponding initial and boundary conditions. The initial condition may be given 
in the quite general form 

7(3,0) =7o(*). (5.9) 

If it is w(t) > 0, we must prescribe 

7(0,t)=7°(*), (5-10) 
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i.e., the concentration of the air in the liquid flowing into the tube from the end 
x = 0. If w(t) < it is necessary to prescribe 

7(M) =7 1 (*), (5.H) 

for the liquid flowing into the tube from the end x — I. Boundary conditions 
(5.10), (5.11) may change one for other during the time but never can be pre- 
scribed both together since w is independent of x. Hence, the problem given 
by (5.8), (5.9), and (5.10) and (5.11), respectively must be solved after such 
time-intervals, on which the function w(t) does not change the sign. In order to 
avoid the complicated testing let us choose a particular situation which is also 
applicable in practice. 
Suppose that 

Pi > Po, w < 0, — — — - f(w ) < 0, 
Pol 

f(-0 = -f(0, /'(£) > 0, ?eR. (5.12) 

Physically, these assumptions mean 

(i) the pressure on the left-hand side of the tube is greater than on the right- 
hand side; 

(ii) the liquid flows from the right-hand side to the left-hand side at the be- 
ginning; 

(iii) the difference of the pressures is not yet weighted by the force of the 
resistance of the duct at the beginning; 

(iv) the force of the resistance always acts against the direction of the motion 
of the fluid and it does not abate with increasing velocity. 

Assumptions (5.12) and equations (5.6), (5.7) imply w < 0. Thus w(t) is a 
decreasing function for increasing t either for any t > or there exists a t* > 
such that f(w(t*)) = (po — pij/pol and then w{t) = w(t*) for t > t* . The 
reason is that the function w(t) = w(t*) is a solution of equation (5.6) on the 
interval (£*, oo) with the initial condition w(t*) and that the equation (5.6) has 
the unique solution for the given initial condition. Hence w(t) < for all t > 
and we have to solve the problem (5.8), (5.9), (5.11) for determining 7. 

Let us apply the method of characteristics. Let x € [0, 1], t > be arbitrary. 
Put 

X(T-,x,t) = x+ w(s)ds. (5.13) 

Then X T {r;x,t) = w(t). If wc put, moreover, 

«Kt)=7(*(t;M),t). (5.14) 
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we have 

<A(T) = "f X X T + J t = 7( + WTj; 

= ff (0( T ),p o + ^^Af(T ;a; ,i)). (5.15) 

Thus if x and £ are given we obtained a differential equation for the function 
4>(t) given by (5.14). If we find <jf>(r) for r € [0,£], then 

7(«,*) = 0(t) (5-16) 

as it follows from (5.14), (5.13). The initial condition for the function </>(r) is 
given either by the formula 

0(O)=7(#(O;s,t),O)=7o(*- / w(s)ds) (5.17) 

Jo 

if the characteristic £ = A^r; x, i) falls on the axis r = in the interval < £ < Z 
or by the formula 

0(r*)= 7 (i,T*)=7V) (5.18) 

where X{r*; x, i) = I if this characteristic falls on the axis £ = Z at some point 
r = r* > 0. The value of t* is to be computed from an implicit equation 

t 
w(s)ds = X(r*;x,t). (5.19) 

J T* 

Put 

l/(T) = V (r;x,t)-^(T), (5.20) 

where 

v(T;x,t)=^-K H (p + ^^(x+ f w(s)ds)) (5.21) 

and 

^-, if £>0 

fc-(0 = < / (5.22) 

Then (5.15) can be written in the form y + yK(y) = ip T and it is, according to 
(5.21) and (5.12), ip T = Ku((po — Pi)/l) w ( T ) > 0- Solve the equation 

y + yK(y) = K H ^^w(r) (5.23) 
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with the initial condition 

y(O)=¥>(O;s,t)-0(O) 

= 7 - K H ( po + — — : — -(x+ w(s)ds)) -7o(»- / w(s)ds) = y 

(5.24) 



which corresponds to the situation when the characteristic £ = A"(r;x,i) falls 
on the axis £. If it is now yo > 0, then the solution of problem (5.23), (5.24) is 
the function 

y(r) = expy-jf L T)y + K H — — -/ exp(--^(r - s)J w(s)ds (5.25) 

for t e [0,i\. If yo < 0, then the solution of problem (5.23), (5.24) is the function 

y(T) = cxp[-^T)y + Kh-^ — -/ cxp^--^(T-s)Jw(s)ds (5.26) 

as far as y(r) < 0. If t\ = inf{r; < r < t,y(r) = 0} < t, then it is necessary 
to prolong the solution (5.26) onto the interval [ti,£] by the formula 

y(r)=K H ^^J\^-^(T-s))w(s)ds. (5.27) 

When the characteristic £ = X(r;x,t) falls on the axis £ = I we proceed 
completely analogously; as an initial condition we use 

y(T*) = <p(T*;x,t)-<f>(r*) 

= l-K H (p + P -±^(x+ r w(s)ds))- 1 \r*)=y 1 (5.28) 

and integrate the equation over the interval (t*,£]. At the same time the value 
t* = t*(x, t) will be computed from (5.19). Again, it is necessary to distinguish 
whether yi > or j/i < 0. In the first case we obtain 

y(r) = cxp ( -^{t - t*) J y\ 

+ KH Po_PiJ\ xp f^ {T _ s) \ w{s)d ^ re[r * }t ]. (5 .29) 

In the second case it is necessary, moreover, to distinguish whether y(r\) = 
for some (smallest one) r x <G (r*,i) or y(r) < for all r e (t*,£). In the latter 
case we have 

j/(r) = cxp(- ^(t-t*))^ + k h—j — - / cxp(--^(r-s)Ju;(s)ds 

(5.30) 
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for all t G [t*,£]. In the opposite case the formula (5.30) is valid only for 
r G [t*,ti] and for r G (ri,t), we must, moreover, y(r) prolong using the 
formula 

Po - Pi f T ( K, 



y (r) = K H ^j^j exp^- j^(t - a))w(a)da, r G (n,t). (5.31) 

Finally, the value of 7(2;, t) is found from formula (5.16), where <f>(t) — <p(t; x, t) — 
y(t), if is given by (5.21), y(t) by formulae (5.25) to (5.28), the values yo and 
j/i by (5.24) and (5.28), respectively, and r* by equation (5.19). 
Hence, if 



70 (a- / w(s)ds) <j-K H (p + ^j^(x+ I u»(*)da)), (5.32) 
then, according to (5.26), (5.20) and (5.25), we have 



7(x, t) = 0(t) = ip(t; x, t) - y(t) 
Pi -Po 



= 7 - X ff (p 

-expf- — — tj f 7- K H (p + — — -\x + w(s)dsj 

-7o fa;-/ w(s)dsj J - K H ^— — - / expf- — — (i - s)Jw(s)ds. 

(5.33) 

In other cases we proceed analogously. We will not introduce the resulting 
formulae since the procedure consists in fact only in the routine substitution even 
though formally complicated. It is clear that we may not be able to compute 
w(t) in a closed form for general function /. Consequently also the formulae for 
7(2:, t) will not be explicit in general. 

On the other hand let us introduce the concrete formulae for the physically 
interesting special case, namely, that f(w) — k\w\w. Then equation (5.6) has 
the form 

w = ^L_Pi + kw 2 (5.34) 

Pol 

since w < as it follows from our preceeding investigations and thus fc|w|u; = 
—kw 2 . Moreover it is p\ > po- Put 



*(*) = (J^LY"*,®, t > 0, z(0) = z = (J^-V'V (5.35) 



Then (5.34) can be written in the form 



sK^M^r- < 5 -> 
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Integrating (5.36) over the interval (0, t) we get 

■z-lzo + 1 

. Z + 1 Zq — 1 

where 

a = 2 ( *(Pi-P°) ) 1/2 . (5.38) 

Applying to (5.37) the function exp we obtain 

e Qt . (5.39) 



1 z - 1 „ at 



z + 1 z + 1 
After straightforward but rather lengthy calculation we arive at the formula 

M fPi-Po\ 1/2 ,.x {p 1 -p Q \ 1 / 2 aw + 2+(aw -2)e at 
^H^r) ^UpoT) m, + 2+(^ -2)e^ (5 ' 40) 

where a is given by (5.38) and k is the constant of friction in (4.8). 
Notice, that 

lim w(t) = - ( — — — ) 

which corresponds to the equilibrium state. Since the limit speed of the fluid 
is negative, the fluid flows from the end x = I in direction to x = 0. This in 
accordance with the physical idea that the fluid flows from the place of higher 
pressure to the region, where the pressure is lower. 

All three special solutions which we have introduced have a common defi- 
ciency They are not influenced by the dependence of the sound speed c = c(p, 7) 
on the pressure and the concentration of the air since the member c 2 (p, j)w x 
in equation (2.2) in all cases vanishes. From that reason it seems reasonable to 
drop the member f(w) by putting it equal to zero and to investigate the solu- 
tion of the type w = w\x + wo, P = p{t), 7 = j(t) where wq, w\ are constants. 
Then equation (2.1) (with / = 0) is satisfied identically and from (2.2), (2.3) 
we obtain the system of two ordinary differential equations 

p(t) - p c 2 (p(i),7(*)), 
7 (i) = ff(p(t),7(t)), t>0 

with the initial conditions 

p(0) = po = const. 
7(0) = 70 = const. 

The analytic investigation and the numerical solution of this simple system 
can bring the tentative idea of the influence of the dependence of c = c(p, 7) on 
the behaviour of the system and thus to obtain comparative characteristic for 
the solution of the general problem and for the correspondence with the physical 
concept of the behaviour of the system. 
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